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4. US-07-800-364A-13 (1-1003) 

HUM0P1 Human OP-1 mRNA for osteogenic protein 


LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
ORGANISM 


HUM0P1 1878 bp ss-mRNA PRI 29-MAY-1991 

Human OP-1 mRNA for osteogenic protein 
X51801 

OP-1 gene; osteogenic protein. 
Homo sapiens RNA. 
Homo sapiens 

Eukaryota; Animalia; Chordata; Vertebrata; Mammalia; Theria; 
Eutheria; Primates; Haplorhini; Catarrhini; Horoinidae. 
1 (bases 1 to 1878) 
Oppermann, H. 
Unpublished (1990) 
full automatic 
£ (bases 1 to 1878) 

Oezkaynak, E. , Rueger, D. C. , Drier, E. A., Corbett,C. , Ridge, R.J. , 
Sampath,T.K. and Oppermann, H. 

OP-1 cDNA encodes an osteogenic protein in the TGF-beta family 
EMBO J. 9, £085-2093 (1990) 
full automatic 
*source : t issue=placenta; 
From EMBL entry HS0P1; 
Locat ion/Qua 1 i f iers 
signal 1850. . 1855 

/gene="BMP3" 
polyA_signal 1862. . 1867 


REFERENCE 
AUTHORS 
JOURNAL 
STANDARD 

REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
STANDARD 
COMMENT 

FEATURES 

polyA_ 


dated 12-SEP-1990. 


-1- 


/gene="BMP3" 
polyA_site 1878 

/gene="BMP3" 
CDS 123. .1418 

/gene="BMP3" 

/note="pre-propolypeptide; (AA -29 to 482)" 
/codon_5tart=l 

/translat ion="MHVRSLRAAAPHSFVALWAPLFLLRSALADFSLDNEVHSSFIHR 

RLRSQERREMQREILSILGLPHRPRPHLQGKHNSAPMFMLDLYNAMAVEEGGGPGGQG 

FSYPYKAVFSTQ6PPLASLQDSHFLTDADMVMSFVNLVEHDKEFFHPRYHHREFRFDL 

SK I PEGEA VTAAEFR I YKDY I RERFDNETFR I SV YQVLQEHLGRESDLFLLDSRTLWO 

SEEGWLVFDITATSNHWVVNPRHNLGLQLSVETLDGQSINPKLAGLIGRHGPQNKQPF 

MVflFFKflTEVHFRSIRSTGSKQRSQNRSKTPKNQEALRMflNVflENSSSDQRQACKKHE 

LYVSFRDLGWQDWI IAPEGYAAYYCEGECAFPLNSYMNATNHAIUQTLVHFINPETVP 

KPCCAPTQLNA I S VLYFDDSSN V I LKKYRNM V V RAC6CH " 
CDS 123. . 206 

/gene="BMP3" 
/partial 

/note="signal peptide; (AA -29 to -1)" 
/codon_start=l 

/translat ion="MHVRSLRAAAPHSFVALWAPLFLLRSAL" 
CDS 210. . 1034 

/gene="BMP3" 
/partial 

/note="propeptide; (AA 1-276)" 
/codon_st art=l 

/tran s 1 at i on= " DFSLDNEVHSSF I HRRLRSQERREMQRE I LS I LGLPHRPRPHLQ 

GKHNSAPMFMLDLYNAMAVEEGGGPGGQGFSYPYKAVFSTQGPPLASLQDSHFLTDAD 

M VMSF VNL VEHDKEFFHPRYHHREFRFDLSK I PEGEAVTAAEFR I YKDY I RERFDNET 

FR I SVYQ VLQEHLGRESDLFLLDSRTLWASEEGWL VFD I TATSNHWV VNPRHNLGLQL 

SVETLD6QSINPKLAGLIGRHGPQNKQPFMVAFFKATEVHFRSIRSTGSKQRSQNRS" 
CDS 210. .1017 

/gene="BMP3" 
/partial 

/not e=" propeptide; (alt.) (AA 1-270)" 
/codon start=l 


/tran5lation="DFSLDNEVHSSFIHRRLRSQERREMQREILSILGLPHRPRPHLQ 


GKHNSAPMFMLDLYNAMAVEEGGGPGGQGFSYPYKAVFSTQGPPLASLQDSHFLTDAD 


MVMSFVNLVEHDKEFFHPRYHHREFRFDLSKIPEGEAvTAAEFRIYKDYIRERFDNET 

FRISVYQVLQEHLGRESDLFLLDSRTLWftSEEGWLVFDITftTSNHWVVNPRHNLGLQL 

SVETLDGQS I NPKLAGL I GRHGPQNKQPFM V AFFKATE VHF RS I RSTGBKQ 
CDS 1021.. 1415 

/gene="BMP3" 
/part ial 

/note="mature osteogenic protein; <AA 271-402)" 
/codon_start=3 

/translation="QNRSKTPKNQEALRMANVAENSSSDQRQACKKHELYVSFRDLGW 

QDWI IAPEGYAAYYCEGECAFPLNSYMNATNHAI VQTLVHFINPETVPKPCCAPTQLN 

AISVLYFDDSSNVILKKYRNMVVRACGCH" 
CDS 1038. . 1418 

/gene="BMP3" 
/partial 

/note="mature osteogenic protein; <AA 277-402)" 
/codon_start=l 

/ t ran s 1 at i on= nPKNQEALRMAN V AENSSSDQRQACKKHELY VSFRDLGWQDW I I 

APEGYAAYYCEGECAFPLNSYMNATNHAI VQTL VHF I NPET V PKPCCAPTQLNA I SVL 

YFDDSSNV I LKKYRNM V V RAC6CH " 
BASE COUNT 411 a 592 c 541 g 334 t 

ORIGIN 


Initial Score = 513 Optimized Score = 630 Significance = 30.73 

Residue Identity = 645C Matches = 670 Mismatches = 303 

Gaps = 59 Conservative Substitutions = 0 

X 10 20 30 40 50 60 


GAATTCC6 — AGCCCCATT6GAAGGAGTTCCGCTTTGACCTGACCCAGATCCC66CT6GGGA 
I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
GGAATTCTTCCACCCACGCTACCACCATCG — AG-AGTTCCGGTTTGATCTTTCCAAGATCCCAGAAGGGGA 
560 X 570 580 590 600 610 620 

70 80 90 100 110 120 

GGCGGTCACAGCTGCGGAGTTCC6GATTTACAA6GTGCCCAGCATCC AC-CTGC-TCAACAGGACCCT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III III I 
AGCTGTCACGGCAGCCGAATTCCGGATCTACAAG — GACTA-CATCCG6GAACGCTTCGACAATGAGACGTT 
630 640 650 660 670 680 690 

130 140 150 160 170 180 190 

CCACGTCAGCATGTTCCAG6TGGTCCAGGAGCAGTCCAACAGGGAGTCTGACTTGTTCTTTTTGGATCTTCA 
II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I 
CCGGATCAGC6TTTATCAGGTGCTCCA6GAGCACTTGGGCAGG6AATCGGATCTCTTCCTGCTCGACAGCCG 


700 710 720 730 740 750 760 

£00 210 220 230 240 250 260 270 

SAC6CTCCBA6CTGGAGACGA66GCTG6CTGGT6CT6GAT6TCACAGCAGCCAGTGACTGCTGGTT6CTGAA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
TflCCCTCTGGGCCTCGGfiGGflGGBCTBBCTGGTBTTTGftCfiTCflCABCCfiCCflSCfifiCCftCTBGGTSGTCflA 
770 780 790 800 810 820 830 

280 290 300 310 320 330 

GCGTCACAflGGflCCTGGGflCTCCGCCTCTflTGTGGfiGACTGftGGATGGGCACAGCGTGBflTCCTBGCCT 

ill I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I III I 

TCCBCGGCACA ACCTGGGCCTGCAGCTCTCGGTG6AGACGCTGGATGG6CA6AGCATCAACCCCAAGTT 

840 850 860 870 880 890 900 

340 350 360 370 380 390 400 

BGCCGGCCTGCTGGGTC — AAC6SGCCCCAC6CTCCCAACAGCCTTTCGTGGTCACTTTCTTCABGSCCAGT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
G6CGG6CCTGATTG6GCGGCACG66CCCCA-GAAC-AAGCAGCCCTTCATGGTGGCTTTCTTCAAGGCCA — 
910 920 930 940 950 960 970 

410 420 430 440 450 460 470 

CCGA-GTCC-CATCCGCACCCCTCGGGCAGTGA66CCACTGAGGAGGAG6CAGCCGAAGAAAAGCAACGAGC 
I I I I I I I I I I I I I I I I I I I III I I II I I I I I I I I I I II III 
CGGAGGTCCACTTCCGCAGCATCCGGTCCACGGGG — AGCAAACAG — CGCAGCC — AGAACCGCTCCAAGA 

980 990 1000 1010 1020 1030 

480 490 500 510 520 530 540 

TGCCGCA66CCAACCGACTCCCA66GAT — CTTTGATGAC-GTCCACGGCTCCCACGGCC GGCAGGTCT 

III II II I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

CGCCCAA6AACCAGGAAGCCCTGCG6ATGGCCAACGTGGCAGAGAACAGCAGCAGCGACCAGA6GCAGGCCT 

1040 1050 1060 1070 1080 1090 1100 1110 

550 560 570 580 590 600 610 

GCCGTCGGCACGAGCTCTACGTCAGCTTCCAGGACCTTGGCT6GCTGGACTGGGTCATCGCCCCCCAA6GCT 

I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
GTAA6AAGCACGAGCTGTATGTCAGCTTCCGAGACCTGGGCT6GCAGGACTGGATCATCGCGCCTGAAGGCT 

1120 1130 1140 1150 1160 1170 1180 

620 630 640 650 660 670 680 

ACTCAGCCTATTACTGTGAGBBBBASTGCTCCTTCCCGCTGGACTCCTGCATGAACGCCACCAACCAC6CCA 

II I Mill I I I I I I I I I I I I I I I I I lllllll III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACGCCGCCTACTACTGTGAGGGGGAGTGTGCCTTCCCTCTGAACTCCTACATGAACGCCACCAACCACGCCA 

1190 1200 1210 1220 1230 1240 1250 

690 700 710 720 730 740 750 

TCCTGCAGTCCCTGGTGCACCTGATGAAGCC — AAACGCAGTCCCCAAGGCGTGCTGTGCACCCACCAAGCT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
TCGTGCAGACGCTGGTCCACTTCATCAACCCGGAAAC6~STSCCCAA6CCCTBCT6TSC6CCCACBCASCT 
1260 1270 1280 1290 1300 1310 1320 


-4- 


760 770 780 790 800 810 820 830 

GAGC6CCACCTCTGTGCTCTACTATGACAGCAGCAACAACGTCATCCTGCGCAAGCACCGCAACAT6GTGGT 
I I I I I I I I I I I I I I I I I II I I I I 1 I I I I I I I I I I I I II I I I I I I I I I I I I I I 
CflftTGCCflTCTCCGTCCTCTflCTTCGftTGflCfiGCTCCAflCGTCftTCCTGAflGfiflflTACflGflftflCflTGGTGGT 
1330 1340 1350 1360 1370 1380 1390 

840 850 860 870 880 890 900 

CAAGGCCTGCG6CTGCCACTGAGTCAGCC CGCCCAGCCCTACT6CAGCCACCCTTCTCATCTGGATCGG 

I I I I I I I I I I I I I I I I I II II I I I I I II I I I I II I I I I I I I I 

CCG66CCTGT6GCTGCCACTAGCTCCTCCGAGAATTCA6ACCCTTTGGGGCCAAGTTT TTCTGGATCCT 

1400 1410 1420 1430 1440 1450 1460 

910 920 930 940 950 960 970 

GCCCTGCAGAG6CAGAAAACCCTTAAATGCTGTCACAGCTCAAGCAGGAGTGTCA6GGGCCCTCACTCTCGG 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

CCATTGC — TCGCCTTGGCCAGGAACCAGCAGAC-CAACTGCCTTTTGTGAGAC-CTTCCCCTCCCTATCCC 

1470 1480 1490 1500 1510 1520 1530 

980 990 1000 X 

TGCCTACTT CCTGTCAGGCTTCTGGGAATTC 

II II I I I I I I I I I I I 

CAACT — TTAAAGGTGTGAGAGTATTAGGAAACATGAGCAGCAT 

1540 1550 1560 X 1570 


7. US-07-800-364A-13 (1-1003) 

HUMTGFBD Human transforming growth factor-beta (tgf-beta) m 


LOCUS 
DEFINITION 

ACCESSION 
KEYWORDS 
SOURCE 
ORGANISM 


REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
STANDARD 
FEATURES 
CDS 


HUMTGFBD 1448 bp ss-mRNA PRI 15-JAN-1991 

Human transforming growth factor-beta (tgf-beta) raRNA, complete 
cds. 

M60316 M38693 M38695 
transforming growth factor-beta. 
Human cell line U-2 OS, cDNA to mRNA. 
Homo sapiens 

Eukaryota; Animalia; Chordata; Vertebrata; Mammalia; Theria; 
Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. 
1 (bases 1 to 1448) 

Celeste, A. J. , Iannazzi, J. A. , Taylor, R. C. , Hewick, R. M. , Rosen, V. , 
Wang, E. A. and Wozney,J. M. 

Identification of new tgf-beta family members present in 
bone-inductive protein purified from bovine bone 
Proc. Natl. Acad. Sci. U.S.A. 87, 9843-9847 (1990) 
full automatic 

Locat i on/Qua 1 i f i ers 

97. . 1392 

/gene="tgf-beta" 

/product="transf orming growth factor-beta" 
/codon start=l 


/tran s 1 at i on= "MHVRSLRAAAPHSFVALWAPLFLLRSALADFSLDNEVHSSF I HR 


RLRSQERREMQREILSILGLPHRPRPHLQGKHNSAPMFMLDLYNAMAVEEGGGPGGQG 

FSYPYKAVFSTQGPPLASLQDSHFLTDADMVMSFVNLVEHDKEFFHPRYHHREFRFDL 

SK I PEGEA VTAAEFR I YKDY I RE RFDNETF R I S V YQ VLQEHLGRESDLFLLDSRTL WA 

SEEGWL VFD I TATSNHW V VNPRHNLGLQLS VETLDGQS I NPKLAGL 1 6RHGPQNKQPF 

MVAFFKATEVHFRSIR5TGSKQRSQNRSKTPKNQEALRMANVAENSSSDQRQACKKHE 

LYVSFRDLGWQDWI I APEGYAAYYCE6ECAFPLNSYMNATNHA I VQTLVHFINPETVP 

KPCCAPTQLNA I S VL YFDDSSN V ILKKYRNMVV RACGCH " 
BASE COUNT 285 a 490 c 422 g 251 t 

ORIGIN 

Initial Score = 496 Optinized Score = 597 Significance = 29.65 
Residue Identity = 67% Matches = 630 Mismatches = 255 

Gaps = 50 Conservative Substitutions = 0 

X 10 20 30 40 50 60 

GAATTCCG — AGCCCCATTGGAAGGAGTTCCGCTTTGACCTGACCCA6ATCCCGGCT66G6A 
I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
G6AATTCTTCCACCCACGCTACCACCATCG — AG-AGTTCCGGTTTGATCTTTCCAAGATCCCAGAAG66GA 
530 540 550 560 570 580 590 

70 80 90 100 110 120 

GGCGGTCACAGCTGCGGAGTTCCGGATTTACAAGGTGCCCAGCATCC AC-CTGC-TCAACAGGACCCT 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I III III I 
AGCTGTCACGGCAGCCGAATTCCGGATCTACAA6 — GACTA-CATCCG6GAACGCTTCGACAATGAGACGTT 

600 610 620 630 640 650 660 

130 140 150 160 170 180 190 

CCAC6TCAGCATGTTCCAGGTGGTCCAGGAGCA6TCCAACAGGGA6TCTGACTTGTTCTTTTTGGATCTTCA 
II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I 
CCG6ATCAGCGTTTATCA6GTGCTCCAGGAGCACTTGGGCAGGGAATCGGATCTCTTCCTGCTCGACAGCCG 
670 680 690 700 710 720 730 

200 210 220 230 240 250 260 270 

GACGCTCCGAGCT6GAGACGAGGGCTGGCTGGTGCTGGATGTCACAGCAGCCAGTGACTGCTGGTTGCTGAA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
TACCCTCTGGGCCTCGGAGGAGGGCTGGCT6GT6TTTGACATCACAGCCACCAGCAACCACTGGGTGGTCAA 
740 750 760 770 780 790 800 

280 290 300 310 320 330 

GCGTCACAAGGACCTGGGACTCC6CCTCTATGTGGAGACTGAGGATGGGCACAGCGTGGATCCTGGCCT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I 
TCCGCGGCACA ACCTGGGCCTGCAGCTCTCGGTGGAGACGCTGGATGGGCAGAGCATCAACCCCAAGTT 


810 820 


840 850 860 870 


340 350 3&0 370 380 390 400 

GGCCGGCCTGCTGGGTC — AACGGGCCCCACGCTCCCAACA6CCTTTCGT6GTCACTTTCTTCAGGGCCAGT 
I I I 1 I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGCGGGCCTGfiTTGGGCGGCfiCGGGCCCCfl-GflflC-flflGCflGCCCTTCflTGGTGGCTTTCTTCftflGGCCfl — 
880 890 900 910 920 930 940 

410 420 430 440 450 460 470 

CCGft-GTCC-CflTCCGCflCCCCTCGGGCflGTGflGGCCftCTGflGGflGGfiGGCfiGCCGfiftGflflflfiGCflftCGflGC 
I I I I I I I I I I I I I I I I I I I III I I II I I I I I I I II I II III 
C6Gfl6GTCCflCTTCCGCflGCftTCCGGTCCflC6GGG~fiGCflflflCfiG~CGCfl6CC~ftGflflCCGCTCCflflGfl 
950 9&0 970 980 990 1000 1010 

480 490 500 510 520 530 540 

TGCCGCAGGCCAACCGACTCCCAGGGAT — CTTTGATGAC-GTCCACGGCTCCCACGGCC GGCAGGTCT 

III II II I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

CGCCCAAGAflCCAGGAAGCCCTGCGGfiTGGCCAfiCGTGGCflGAGflflCflGCAGCOGCGACCflGAGGCAGGCCT 

1020 1030 1040 1050 1060 1070 1080 

550 560 570 580 590 600 610 

GCCGTCGGCAC6A6CTCTACGTCAGCTTCCAGGACCTTGGCTGGCTGGACTG66TCATCGCCCCCCAA6GCT 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
GTAAGAAGCACGAGCT6TAT6TCAGCTTCCGAGACCTGGGCTGGCAGGACT6GATCATCGC6CCTGAA66CT 

1090 1100 1110 1120 1130 1140 1150 

620 630 640 650 660 670 680 

ACTCAGCCTATTACTGTGAGGGGGAGTGCTCCTTCCCGCTGGACTCCTGCATGAACGCCACCAACCACGCCA 

ii i iiiii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 mini iii linn in i m 1 1 m i m 1 1 1 m i 

ACGCCGCCTACTACTGTGAGGGGGAGTGTGCCTTCCCTCTGAACTCCTACATGAACGCCACCAACCACGCCA 
1160 1170 1180 1190 1200 1210 1220 

690 700 710 720 730 740 750 

TCCTGCAGTCCCTGGTGCACCTGATGAAGCC — AAACGCAGTCCCCAAGGC6TGCTGTGCACCCACCAAGCT 

II I I III I I I I I I III I II II II lllll II III III I III III I I IIIII I I I I 
TC6T6CAGACGCTGGTCCACTTCATCAACCCGGAAACG — GTGCCCAAGCCCTGCTGTGCGCCCAC6CAGCT 

1230 1240 1250 1260 1270 1280 1290 

760 770 780 790 800 810 820 830 

GAGCGCCACCTCTGTGCTCTACTATGACAGCAGCAACAACGTCATCCTGCGCAAGCACCGCAACATGGTGGT 
I I I I I I I I I I I III III II I I I I I I I I I I I III I I I II I I I I I I III I I I II 
CAATGCCATCTCCGTCCTCTACTTCGATGACAGCTCCAACGTCATCCTGAAGAAATACAGAAACATGGTGGT 

1300 1310 1320 1330 1340 1350 1360 1370 

840 850 860 870 880 890 900 

CAAGGCCTGCGGCTGCCACTGAGTCAGCC CGCCCAGCCCTACTGCAGCCACCCTTCTCATCTGGATCGG 

I I I I III III I I I III I II II lllll II I III II I I I III I I 

CCGGGCCTGTGGCTGCCACTA6CTCCTCCGAGAATTCAGACCCTTTGGGGCCAAGTTT TTCTGGATCCT 

1380 1390 1400 1410 1420 1430 


-7- 


910 

GCCCTGCAGAGGCAGAAAA 

I I I I 
CCATTGCTC 
144© X 


8. US-07-800-364A-13 (1-1003) 

MUS0P1G Mouse OP-1 niRNA for osteogenic protein 1 

LOCUS MUSOP1G 1872 bp ss-mRNA ROD 10-OCT-1991 

DEFINITION Mouse OP-1 mRNA for osteogenic protein 1 
ACCESSION X56906 

KEYWORDS OP-i gene; osteogenic protein. 
SOURCE Mus musculus RNA. 

ORGANISM Mus musculus 

Eukaryota; Animalia; Chordata; Vertebrata; Mammalia; Theria; 
Eutheria; Rodentia; Myomorpha; Muridae; Murinae. 
REFERENCE 1 (bases 1 to 1872) 

AUTHORS Ozkaynak,E. , Schnegelsberg, P. N. and Oppermann,H. 
TITLE Murine osteogenic protein (OP-1): High levels of roRNA in kidney 
JOURNAL Biochem. Biophys. Res. Commun. 179, 116-123 (1991) 
STANDARD full automatic 
COMMENT Sequenced molecule was a DNA/cDNA composite:- genomic clone bases 
1-465, cDNA clone bases 267-1872. 

From EMBL entry MM0P1G; dated 26-SEP-1991. 
FEATURES Locat i on/Qua 1 i f i er s 

sig_pept ide 104. . 190 

/codon_start=l 
mat ^peptide 977. .1393 

/product="osteogenic protein 1" 
/gene^OP-l" 
/codon_start=l 
CDS 104. . 1396 

/gene="OP-l" 

/product = n pre-propept ide" 
/codon_start=l 

/ 1 ran s 1 at i on= " MHV RSLRAAAPHSF V ALWAPLFLLRSAL ADFSLDNE VHSSF I HR 
RLRSQERREMQREILSILGLPHRPRPHLQ6KHNSAPMFMLDLYNAMAVEESGPDGQ6F 
SYPYKAVFSTQGPPLASLQDSHFLTDADMVMSFVNLVEHDKEFFHPRYHHREFRFDLS 
K I PEGER VT AAEFR I YKDY I RERFDNETFQ I T V YQ VLQEHSGRESDLFLLDSRT I WAS 
EEGWL VFD I TATSNHW V VNPRHNLGLGLS VETLDGQS I NPKLAGL I GRH6PQNKQPFM 
VAFFKATEMHLRSIRSTGGKQRSQNRSKTPKNQEALRMASVAENSSSDQRQACKKHEL 


-8~ 


Y VSFRDLGWQDW 1 1 APEGYAAYYCEGECAFPLNSYMNATNHA IVQTLVHFI NPDTVPK 

PCCAPTQLNA I S VL YFDDSSN V ILKKYRNMVv" RAC6CH " 
BASE COUNT 433 a 591 c 501 g 347 t 

ORIGIN 

Initial Score = 489 Optimized Score = 637 Significance = £9.20 
Residue Identity = 65% Matches = 676 Mismatches = 300 

Gaps = 55 Conservative Substitutions = 0 

X 10 20 30 40 50 60 

GAATTCCG — AGCCCCATTGGAAGGAGTTCC6CTTTGACCTGACCCAGATCCC6GCTGG6GA 
I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

AGAATTCTTCCACCCTC6ATACCACCATCG GGAGTTCCGGTTTGATCTTTCCAA6ATCCCCGAGGGCGA 

540 X 550 560 570 580 590 600 


70 80 90 100 110 120 

6GCGGTCACAGCTGC6GAGTTCCGGATTTACAAGG — TGC — CCAGCATCCACCTGCTCAACAGGACCCTCC 
I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I 
ACGGGTGACCGCAGCCGAATTCAGGATCTATAAGGACTACATCCGGGAGCGATTTG-ACAACGAGACCTTCC 
610 620 630 640 650 660 670 


130 140 150 160 170 180 190 200 

ACGTCAGCATGTTCCAG6TGGTCCA6GAGCAGTCCAACAGGGA6TCTGACTTGTTCTTTTTGGATCTTCAGA 
I III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGATCACAGTCTATCAGGTGCTCCAGGAGCACTCAGGCAGGGA6TCGGACCTCTTCTTGCTGGACAGCCGCA 
680 690 700 710 720 730 740 

210 220 230 240 250 260 270 

CGCTCCGAGCTGGAGACGAGGGCTGGCTGGTGCTGGATGTCACAGCAGCCAGTGACTGCTGGTTGCTGAAGC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I 
CCATCTGGGCTTCTGAGGAGG6CTGGTTGGTGTTTGATATCACAGCCACCAGCAACCACTGGGTGGTCAACC 
750 760 770 780 790 800 810 


280 290 300 310 320 330 340 

GTCACAAGGACCTG66ACTCCGCCTCTATGTGGAGACTGAGGATGGGCACAGCGTG6ATCCTGGCCTG6CCG 
II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I 

CTCG6CACAACCTGG6CTTACAGCTCTCTGTGGA6ACCCT6GATGG6CAGAGCATCAACCCCAAGTTGGCAG 
820 830 840 850 860 870 880 


350 360 370 380 390 400 410 

GCCTGCTGGGTC — AACGGGCCCCACGCTCCCAACAGCCTTTCGTGGTCACTTTCTTCAGGGCCAGTCCG-A 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
GCCTGATTGGACGGCATGGACCCCA-GAAC-AAGCAACCCTTCATGGTG6CCTTCTTCAAGGCCA — CGGAA 
890 900 910 920 930 940 950 


420 430 440 450 460 470 480 

GTCC-CATCCGCACCCCTCGG-GCAGTGAGGCCACTGAGGAGGAGGCAGCCGAAGAAAAGCAACGAGCTGCC 
I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I II ill III 

GTCCATCTCCGTAGTATCCGGTCCACGGG6GGCA AGCAG — CGCAGCC — AGAATC6CTCCAAGACGCC 

960 970 980 990 1000 1010 1020 
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490 50© 510 520 530 540 

GCAG-GCCAACCGACTCCCAGG — GATCTTTGATGACGTCCACGGCTCCCACGGCC GGCAGGTCTGCCG 

II I II I III III I I II I I I I I I I III I I I I I I I I I I 
AAAGAACCAAGAGGCCCTGAGGATGGCCAGTGT6GCAGAAAACAGCAGCAGTGACCAGAGGCAGGCCTGCAA 

1030 1040 1050 10&0 1070 1080 1090 

550 560 570 580 590 600 610 620 

TCGGCftCGflGCTCTflCGTCflGCTTCCftGGACCTTGGCTGGCTGGflCTGGGTCflTCGCCCCCCflflGGCTflCTC 
II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II II I I I I I I I I 
GflfiflCfiTGflGCTGTflCGTCfiGCTTCCGfiGflCCTTGGCTGGCfiGGflCTGGflTCftTTGCflCCTGflflGGCTPTGC 
1100 1110 1120 1130 1140 1150 1160 

630 640 650 660 670 680 690 

flGCCTflTTflCTGTGflGGGGGflGTGCTCCTTCCCGCTGGflCTCCTGCATGflACGCCflCCflfiCCflCGCCflTCCT 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGCCTflCTACTGTGflGGGAGflGTGCGCCTTCCCTCTGAACTCCTACflTGAflCGCCACCflACCflCGCCflTCGT 
1170 1180 1190 1200 1210 1220 1230 

700 710 720 730 740 750 760 

6CA6TCCCT6GT6CACCTGATGAAGCCAAACGCAGTCCCCAAGGCGTGCTGTGCACCCACCAA6CT6AGCGC 
III I I I I I I III I II II III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
CCAGACACTGGTTCACTTCATCAACCCAGACACAGTACCCAAGCCCTGCTGTGCGCCCACCCAGCTCAACGC 
1240 1250 1260 1270 1280 1290 1300 

770 780 790 800 810 820 830 

CACCTCTGTGCTCTACTATGACAGCAGCAACAACGTCATCCTGCGCAAGCACCGCAACATGGTGGTCAAGGC 
II I I I I I I I I I I I I I III I I I I II I I I I I I I I I III II I I I I I I I I I I I I I III 
CATCTCTGTCCTCTACTTCGACGACAGCTCTAATGTCATCCTGAAGAAGTACAGAAACATGGTGGTCCGGGC 
1310 1320 1330 1340 1350 1360 1370 1380 

840 850 860 870 880 890 900 

CTGC6GCTGCCACTGA6TCAGCCCG CCCAGCCCTACTGC — AGCCAC-CCTTCTCATCTG6ATCGGGCC 

I I I I I I I I I I I I I II III I I I I I I I III I I I I I I I I I II I III 
CTGTGGCTGCCACTAGCTCTTCCTGAGACCCTGACCT-TTGCGGGGCCACACCTTTCCAAAT-CTTCGATGT 
1390 1400 1410 1420 1430 1440 1450 

910 920 930 940 950 960 970 

CTGCAGAGGCAGAAAACCCTTAAATGCTGTCACAGCTCAAGCAGGA6TGTCAGGGGCCCTCACTCT-CGGTG 
I I I I I II I I I I I I I I I I I I I I I I III III I I I I I I I 

CT-CACCATC-TAAGTCTCTCA CTG-CCCACCTTGGCGAGGA — GCCAACAGACCAACCTCTCCTGAG 

1460 1470 1480 1490 1500 1510 

980 990 1000 X 

CCTACTTCCT6TC — A6GCTTCTGG — GAATTC 
I I I I III I I III III 

CCTTC-CCCTCACCTCCCCAACCGGAAGCATGTAAGGGTTCCA 
1520 1530 1540 X 1550 

> 0 ( 

□I 10 IntelliGenetics 
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FastDB - Fast Pairwise Comparison of Sequences 
Release 5.4 

Results file us-07-800-364a-14-pir. res made by maryh on Thu £1 Jan 93 
19:33:50-PST. 


Query sequence being compared :US-07-800-364A-14 (1-£81) 
Number of sequences searched: 44900 
Number of scores above cutoff: 4198 


Results of the initial comparison of US-07-800-364A-14 (1-281) with: 
Data bank : PIR 34, all entries 




***♦ 53 standard deviations above mean 





1. 

SI 0529 

•Osteogenic protein precursor 431 

13£ 

158 

53. £5 

0 

2. 

C39263 

*Bone morphogenetic protein 7 431 
**** 51 standard deviations above mean 

13£ 
###* 

158 

53. £5 

0 

3. 

JQ11B4 

Osteogenic protein 1 precurso 430 
**** 50 standard deviations above mean 

132 

155 

51.86 

0 

4. 

039263 

*Bone morphogenetic protein 5 454 

111 

153 

50. 93 

0 

5. 

033925 

*vg-l-relat ed protein precurs 438 

128 

15£ 

50. 47 

0 

6. 

B39263 

*Bone morphogenetic protein & 513 
**** £5 standard deviations above mean 

128 

151 

50.01 

0 

7. 

041233 

Bone morphogenetic protein ho 455 
**** £1 standard deviations above mean 

63 

**** 

99 

25.93 

0 

8. 

0261 58 

Decapentaplegic protein precu 588 
**** 19 standard deviations above mean 

54 

**** 

89 

21.30 

0 

9. 

C37278 

Bone morphogenetic protein £B 408 
**** 18 standard deviations above mean 

36 

86 

19.91 

0 

10. 

C39364 

*6DF-1 embryonic growth facto 37£ 

28 

83 

18.52 

0 

11. 

029619 

Ugi embryonic growth factor - 360 
**#* 17 standard deviations above mean 

55 

82 

18.06 

0 

12. 

B37278 

Bone morphogenetic protein £ft 396 
***# 16 standard deviations above mean 

35 

#*#* 

81 

17.59 

0 

13. 

S16244 

Bone morphogenetic protein £ 398 
**** 14 standard deviations above mean 

35 

**** 

78 

16.21 

0 

14. 

030884 

Inhibin beta-ft chain precurso 4£6 

£0 

74 

14.35 

0 

15. 

B24248 

Inhibin beta-ft chain precurso 4£6 

£0 

74 

14.35 

0 

16. 

039364 

*GDF-1 embryonic growth facto 357 

£0 

74 

14.35 

0 

17. 

035683 

*GDF-1 embryonic growth facto 357 
**#* 13 standard deviations above mean 

£0 

*##* 

74 

14.35 

0 

IB. 

WFPGBO 

Inhibin beta-ft chain precurso 4£4 
**** l£ standard deviations above mean 

£0 

**** 

73 

13.89 

0 

19. 

B40905 

♦Inhibin beta-ft chain precurs 4£4 

£0 

71 

12.96 

0 

20. 

B40056 

*Inhibin beta-fl chain precurs 4£4 

£0 

71 

12.96 

0 
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1. US-07-800-364A-14 (1-281) 

S10529 *Osteogenic protein precursor - Human 


ENTRY 
TITLE 
DATE 

PLACEMENT 
COMMENT 
SOURCE 
REFERENCE 
#Authors 

#Journal 
#Title 

#Ref erence 
#Accession 
#Cross-ref 

SUMMARY 

SEQUENCE 


S10529 #Type Protein 
♦Osteogenic protein precursor - Human 
15-Jun-199£ #Sequence 15-Jun-199£ #Text 15-Jun-199£ 

0.0 0.0 0.0 0.0 0.0 

*This entry is not verified. 
Homo sapiens #Common-name man 

Oezkaynak E. , Rueger D. C. , Drier E.A. , Corbett C. $ 

Ridge R.J. , Sampath T.K. f Opperiaann H. 
EMBO J. (1990) 9: £085-2093 

OP-1 cDNA encodes an osteogenic protein in the 
TGF-beta family, 
-number S105£9 

S105£9 
erence EMBL:X51801 
#Molecular-weight 49313 #Length 431 #Checksutn 128 


Initial Score = 
Residue Identity = 
Gaps = 


132 Optimized Score = 158 Significance = 53.25 
56% Matches = 161 Mismatches = 115 

11 Conservative Substitutions = 0 


X 10 20 30 40 50 60 

EPHWKEFRFDLTQ I PAGE A VTAAEFR I YK- VPS I HLLNRTLH VSMFQ V VQEQSNRESDLFFL 
1 llllll 11 i I I 1 I I 1 1 I I i I I II I I I II I I I I I I 1 

VEHDKEFFHPRYHHREFRFDLSKIPE6EAMTAAEFRIYKDYIRERFDNETFRISVYQVLQEHLGRESDLFLL 
140 150 160 170 180 190 200 210 


70 80 90 100 110 120 130 

DLQTLRAGDEGWLVLDVTAASDCWLLKRHKDLGLRLYyETEDGHSVDPGLAGLLGQRAPRSQQPFVVTFFRA 
I III II I I I I I 1 I I II I I I M I I I I INI I I 111 1 II I 

DSRTLWASEEGWLyFDITATSNHWVyNPRHNLGLQLSMETLDGQSINPKLAGLIGRHGPQNKQPFMVAFFKA 
220 230 240 250 26© 270 280 


140 150 160 170 180 190 200 

SPSP I RTPRA V RPLRRRQ PKKSNELPQANRLP6IFDDVHGSHGRQVCRRHELYVSFQDLGWLDWVIA 

II II II III I I 1 I I I I I I I 1 I I I I 1 I 1 1 

TEVHFRSIRSTGSKGRSQNRSKTPKNGEALRMAN MAENSSSDQRQACKKHELYMSFRDLGWQDWI I A 

£90 300 310 320 330 340 350 


210 220 230 240 250 260 270 

PQGYSAYYCEGECSFPLDSCMNATNHAILQSLVHLMKPNAVPKACCAPTKLSATSVLYYDSSNNVILRKHRN 
I II I 1 I I I I I 1 III I I I I I I I I I 1 II 1 I 111 I I I 1 I 1 1 I I I I 1 I 11 I I I II 
PEGYAAYYCEGECAFPLNSYMNATNHA I VQTLVHFI NPET VPKPCCAPTQLNA I SVLYFDDSSN V I LKKYRN 

360 370 380 390 400 410 420 


X 
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MVVKACGCH 
IN MM! 
MVVRAC6CH 
430 


2. US-07-800-364A-14 (1-281) 

C39263 *Bone morphogenet ic protein 7 - Human 


ENTRY C39263 #Type Protein 

TITLE *Bone morphogenet ic protein 7 - Human 

DATE 04-Oct-1991 #Sequence 04-Oct-1991 #Text 04-Oct-1991 

PLACEMENT 0.0 0.0 0.0 0.0 0.0 

COMMENT *This entry is not verified. 

SOURCE Homo sapiens #Common-name man 

REFERENCE 

^Authors Celeste A. J. , Iannazzi J. A. 5 Taylor R. C. , Hewick 

R.M. , Rosen V., Wang E.A. , Wozney J.M. 
#Journal Proc. Natl. Acad. Sci. U.S.A. (1990) 87:9843-9847 

#Title Identification of transforming growth factor beta 

family members present in bone-inductive protein 
purified from bovine bone, 
-number A39263 

C39263 
erence GB:M38695 
#Molecular-weight 49313 #Length 431 #Checksum 128 


#Ref erence 
#Accession 
#Cross-ref 

SUMMARY 

SEQUENCE 


Initial Score = 
Residue Identity = 
Gaps = 


132 Optimized Score = 158 Significance = 53.25 
56* Matches = 161 Mismatches = 115 

11 Conservative Substitutions = 0 


X 10 20 30 40 50 60 

EPHWKEFRFDLTQIPAGEAMTAAEFRIYK-VPSIHLLNRTLHVSMFQVMQEQSNRESDLFFL 
I M M M M M M M M M M I II f MM M M M I 

VEHDKEFFHPRYHHREFRFDLSKIPEGEAVTAAEFRIYKDYIRERFDNETFRISUYQVLQEHLGRESDLFLL 
140 150 160 170 180 190 200 210 


70 80 90 100 110 120 130 

DLQTLRAGDEGWLVLDVTAASDCWLLKRHKDLGLRLYVETEDGHSVDPGLAGLLGQRAPRSQQPFy VTFFRA 
! Mi M M I 1 M 1 I 1 M I M I M I I M M I t M I I M I 

DSRTLWASEEGWLVFD I TATSNHWV VNPRHNLGLQLS VETLD6QS I NPKLAGL I GRHGPQNKQPFM VAFFKA 
220 230 240 250 260 270 280 


140 150 160 170 180 190 200 

SPSPIRTPRAVRPLRRRQ -PKKSNELPQANRLPGIFDDVHGSHGRQyCRRHELYVSFQDLGWLDWVI A 

II II M III I II I M II I II M M II I I 

TEVHFRSIRSTGSKQRSQNRSKTPKNQEALRMAN VAENSSSDQRQACKKHELYMSFRDLGWQDWI I A 

290 300 310 320 330 340 350 


210 220 230 240 250 £60 270 

PQGYSAYYCEGECSFPLDSCMNATNHAILQSLVHLMKPNAVPKACCAPTKLSATSVLYYDSSNNVILRKHRN 
I II I I I I I I I I III I I I I I I I I I I III I III I I I I I I I I I I I I I I I I I I II 
PEGYAAYYCEGECAFPLNSYMNATNHA I VQTLVHF I NPET VPKPCCAPTQLNA I S VLYFDDSSNV I LKKYRN 

360 370 380 390 400 410 420 

X 

MVVKAC6CH 
III I I I I I 
MVVRACGCH 
430 


1. US-07-800-364A-14 (1-281) 

BMP7_HUMAN BONE MORPHOGENETIC PROTEIN 7 PRECURSOR (BMP-7) (OS 

ID BMP7_HUMAN STANDARD; PRT; 431 AA. 

AC PI 8075; 

DT 01-NOV-1990 (REL. 16, CREATED) 

DT 01-NOV-1990 (REL. 16, LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. 23, LAST ANNOTATION UPDATE) 

DE BONE MORPHOGENETIC PROTEIN 7 PRECURSOR (BMP-7) (OSTEOGENIC PROTEIN 1) 

DE (OP1). 

6N BMP7 OR OP1. 

OS HOMO SAPIENS (HUMAN). 

OC EUKARYOTA; METAZOA; CHORDATA; VERTEBRATA; TETRAPODA; MAMMALIA; 

OC EUTHERIA; PRIMATES. 

RN 111 

RP SEQUENCE FROM N. A. , AND PARTIAL SEQUENCE. 

RC T I SSUE=PLACENTA ; 

RM 90291971 

RA OEZKAYNAK E. , RUE6ER D. C. , DRIER E. A. , CORBETT C. , RIDGE R. J. , 

RA SAMPATH T. K. , OPPERMANN H. ; 

RL EMBO J. 9:2085-2093(1990). 

RN [£] 

RP SEQUENCE FROM N. A. 

RM 91088608 

RA CELESTE A. J. , IANNAZZI J. A. , TAYLOR R. C. , HEWICK R. M. , ROSEN V., 

RA WANG E. A. , WOZNEY J. M. ; 

RL PROC. NATL. ACAD. SCI. U.S.A. 87:9843-9847(1990). 

CC -!- FUNCTION: INDUCES CARTILAGE AND BONE FORMATION. MAY BE THE 

CC OSTEOINDUCTIVE FACTOR RESPONSIBLE FOR THE PHENOMENON OF 

CC EPITHELIAL OSTEOGENESIS. 

CC -!- SUBUNIT: HOMODIMER, D I SULF I DE-LINKED. 

CC -!- SIMILARITY: TO OTHER GROWTH FACTORS OF THE TGF-BETA FAMILY. 

DR EMBL; X51801 ; HSOP1. 

DR EMBL; M&0316; HSTGFBD. 

DR PIR; S10529; S10529. 

DR PIR; C39263; C39263. 

DR PROSITE; PS00250; TGF_BETA. 
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SIGNAL; GROWTH FACTOR; CYTOKINE; BONE; CARTILAGE; GLYCOPROTEIN. 

CT 

r \ 

bluNHL 

1 

C7 

DHTCMT T Ol 
KU 1 fcIN 1 1HL. 

CT 

FT 

DPflDPD 


OQO 

DflTPMT TGI 
rU I CP* 1 IHLi 

r 1 

CHAIN 

293 

431 

BONE M0RPH06ENETIC PROTEIN 7. 

FT 

DISULFID 

3313 

396 

BY SIMILARITY. 

FT 

DISULFID 

359 

428 

BY SIMILARITY. 

FT 

DISULFID 

363 

430 

BY SIMILARITY. 

FT 

DISULFID 

395 

395 

INTERCHAIN (BY SIMILARITY). 

FT 

CARBOHYD 

187 

187 

POTENTIAL. 

FT 

CARBOHYD 

302 

302 

POTENTIAL. 

FT 

CARBOHYD 

321 

321 

POTENTIAL. 

SQ 

SEQUENCE 

431 AA; 

49313 

MW; 936534 CN; 


Initial Score = 132 
Residue Identity = S&tf 
Gaps = 1 1 


Optimized Score = 158 
Matches = 161 

Conservative Substitutions 


Significance = 54.74 
Mismatches = 115 

0 


X 10 20 30 40 50 60 

EPHWKEF RFDLTQ I PAGEAVTAAEFR I YK- VPS I HLLNRTLH VSMFQ V VQEQSNRESDLFFL 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

VEHDKEFFHPRYHHREFRFDLSKIPEGEAyTAAEFRIYKDYIRERFDNETFRISVYQVLQEHLGRESDLFLL 
140 150 160 170 180 190 200 210 


70 80 90 100 110 120 130 

DLQTLRA6DEGWLVLDVTAASDCWLLKRHKDLGLRLYVETEDGHSVDPGLAGLLGQRAPRSQQPFVVTFFRA 
I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

DSRTLWASEEGWL VFDI TATSNHWV VNPRHNLGLQLSVETLDGQS I NPKLAGL I GRHGPQNKQPFM V AFFKA 
220 230 240 250 260 270 280 

140 150 160 170 180 190 £00 

SPSP I RTPRAVRPLRRRQ PKKSNELPQANRLPG I FDDVHGSHGRQVCRRHELY VSFQDLGWLDW v I A 

II II II III I I I I I I I I I I I I I I I I I I I 

TEVHFRS I RSTGSKQRSQNRSKTPKNQEALRMAN VAENSSSDQRQACKKHELY VSFRDLGWQDW I I A 

£90 300 310 320 330 340 350 


210 220 230 240 250 260 £70 

PQ6YSAYYCEGECSFPLDSCMNATNHAILQSLVHLMKPNAVPKACCAPTKLSATSVLYYDSSNNVILRKHRN 
I II I I I I I I I I III I I I I I I I I I I III I III I I I I I I I I I I I I I I I I I I II 
PE6YAAYYCEGECAFPLNSYMNATNHAI VQTLVHFINPETVPKPCCAPTQLNAISVLYFDDSSNVILKKYRN 

360 370 380 390 400 410 420 


X 

MVVKACGCH 
III I I I I I 
MVVRACGCH 
430 


2. US-07-800-364A-14 < 1-281) 

BMP7 MOUSE BONE MORPHOGENETIC PROTEIN 7 PRECURSOR (BMP-7) (OS 


\ 


ID BMP7_M0USE STANDARD ; PRT; 430 Aft. 

AC P£3359 ; 

DT 01-NOV-1991 (REL. £0, CREATED) 

DT 01-NOV-1991 (REL. £0, LAST SEQUENCE UPDATE) 

DT 01-AUG-199£ (REL. £3, LAST ANNOTATION UPDATE) 

DE BONE MORPHOGENETIC PROTEIN 7 PRECURSOR (BMP-7) (OSTEOGENIC PROTEIN 1) 

DE (OP1). 

GN BMP7 OR OP1. 

OS MUS MUSCULUS (MOUSE). 

OC EUKARYOTA; METAZOA; CHORDATA; VERTEBRATA; TETRAPODA; MAMMALIA; 

OC EUTHERIA; RODENTIA. 

RN CI] 

RP SEQUENCE FROM N. A. 

RM 91354£37 

RA OEZKAYNAK E. , SCHNEGELSBERG P. N. J. , OPPERMANN H. 5 

RL BIOCHEM. BIOPHYS. RES. COMMUN. 179 : 1 16-1£3 ( 1991 ) . 

CC -!- FUNCTION: INDUCES CARTILAGE AND BONE FORMATION. MAY BE THE 

CC OSTEOINDUCTIVE FACTOR RESPONSIBLE FOR THE PHENOMENON OF 

CC EPITHELIAL OSTEOGENESIS. 

CC -!- SUBUNIT: HOMODIMER, D I SULF I DE-LINKED. 

CC -!- SIMILARITY: TO OTHER GROWTH FACTORS OF THE TGF-BETA FAMILY. 

DR EMBL; X56906; MMOP1G. 

DR PIR; JQ1184; JQ1184. 

DR PROSITE; PS00£50; TGF_BETA. 

KW SIGNAL; GROWTH FACTOR; CYTOKINE; BONE; CARTILAGE; GLYCOPROTEIN. 


FT 

SIGNAL 

1 

£9 

POTENTIAL. 

FT 

PROPEP 

30 

£91 

POTENTIAL. 

FT 

CHAIN 

£9£ 

430 

BONE MORPHOGENETIC PROTEIN 7, 

FT 

DISULFID 

3£9 

395 

BY SIMILARITY. 

FT 

DISULFID 

358 

4£7 

BY SIMILARITY. 

FT 

DISULFID 

3fc£ 

4£9 

BY SIMILARITY. 

FT 

DISULFID 

394 

394 

INTERCHAIN (BY SIMILARITY). 

FT 

CARBOHYD 

186 

18& 

POTENTIAL. 

FT 

CARBOHYD 

301 

301 

POTENTIAL. 

FT 

CARBOHYD 

3£0 

3£0 

POTENTIAL. 

FT 

CARBOHYD 

371 

371 

POTENTIAL. 

SQ 

SEQUENCE 

.430 AA; 

49283 

MW; 934349 CN; 


Initial Score = 13£ Optimized Score = 155 Significance = 53. 3£ 
Residue Identity = 55* Matches = 158 Mismatches = 118 

Gaps = 11 Conservative Substitutions = 0 

X 10 £0 30 40 50 60 

EPHWKEFRFDLTQIPAGEAVTAAEFRIYK-VPSIHLLNRTLHVSMFQVVQEQSNRESDLFFL 
I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

VEHDKEFFHPRYHHREFRFDLSKIPEGERVTAAEFRIYKDYIRERFDNETFQITVYQVLQEHSGRESDLFLL 
140 150 160 170 180 190 £00 210 

70 80 90 100 110 1£0 130 

DLQTLRAGDEGWLVLDVTAASDCWLLKRHKDLGLRLYVETEDGHSVDPGLAGLLGQRAPRSQQPFVVTFFRA 
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I I I 1 1 1 1 1 I 1 1 I I 1 1 1 I 1 1 1 1 1 I I 1 1 1 1 I I 1 1 1 I 1 1 I 

DSRTIWASEEGWLVFDITATSNHWVVNPRHNLGLQLSVETLDGQSINPKLAGLIGRHGPGNKQPFMVAFFKA 
££0 23© £40 £50 £60 £70 £B0 

140 150 160 170 180 190 £00 

SPSPIRTPRAVRPLRRRQ PKKSNELPQANRLPG I FDDVHGSHGRQ VCRRHELYVSFQDLGWLDW V I A 

II II II II I I I I I I I I I I I I I I I I I I I 

TEVHLRSIRSTGGKQRSQNRSKTPKNQEALRMA SVAENSSSDQRQACKKHELYVSFRDLGWQDWI IA 

£90 300 310 3£0 330 340 

£10 ££0 £30 £40 £50 £60 £70 

PQGYSAYYCEGECSFPLDSCMNATNHAILQSLVHLMKPNAVPKACCAPTKLSATSVLYYDSSNNVILRKHRN 
I II I I I I I I I I III I I I I I I I I I I III I III I I I I I I I I I I I I I I I I I I II 
PEGYAAYYCEGECAFPLNSYMNATNHAI VQTLVHFINPDTVPKPCCAPTQLNAISVLYFDDSSNVILKKYRN 
350 360 370 380 390 400 410 4£0 

X 

MVVKACGCH 
III I I I I I 
MVVRACGCH 
430 


3. US-07-800-364A-14 (1-£81) 

BMP5_HUMAN BONE MORPHOGENETIC PROTEIN 5 PRECURSOR (BMP-5). 

ID BMP5_HUMAN STANDARD; PRT; 454 AA. 

AC P££003; 

DT 01-AU6-1991 (REL. 19, CREATED) 

DT 01-AUG-1991 (REL. 19, LAST SEQUENCE UPDATE) 

DT 01-AU6-199£ (REL. £3, LAST ANNOTATION UPDATE) 

DE BONE MORPHOGENETIC PROTEIN 5 PRECURSOR (BMP-5). 

OS HOMO SAPIENS (HUMAN). 

OC EUKARYOTA; METAZOA; CHORDATA; VERTEBRATA; TETRAPODA; MAMMALIA; 

OC EUTHERIA; PRIMATES. 

RN CI 3 

RP SEQUENCE FROM N. A. 

RC TISSUE=BONE; 

RM 91088608 

RA CELESTE A. J. , IANNAZZI J. A. , TAYLOR R. C. , HEWICK R. M. , ROSEN V. , 

RA WANG E. A. , WOZNEY J. M. ; 

RL PROC. NATL. ACAD. SCI. U.S.A. 87:9843-9847(1990). 

CC -!- FUNCTION: INDUCES CARTILAGE AND BONE FORMATION. 

CC -!- SIMILARITY: TO OTHER GROWTH FACTORS OF THE TGF-BETA FAMILY. 

DR EMBL; M60314; HSTGFBB. 

DR PIR; A39£63; A39£63. 

DR PROSITE; PS00£50; TGF_BETA. 

KW SIGNAL; GROWTH FACTOR; CYTOKINE; BONE; CARTILAGE; GLYCOPROTEIN. 

FT SIGNAL 1 ? POTENTIAL. 

FT PROPEP ? 3££ POTENTIAL. 
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FT 

CHAIN 

323 

454 

BONE MORPHOGENETIC PROTEIN 5. 

FT 

DISULFID 

353 

419 

BY SIMILARITY. 

FT 

DISULFID 

382 

451 

BY SIMILARITY. 

FT 

DISULFID 

386 

453 

BY SIMILARITY. 

FT 

DISULFID 

418 

418 

INTERCHAIN (BY SIMILARITY). 

FT 

CARBOHYD 

211 

211 

POTENTIAL. 

FT 

CARBOHYD 

327 

327 

POTENTIAL. 

FT 

CARBOHYD 

345 

345 

POTENTIAL. 

FT 

CARBOHYD 

395 

395 

POTENTIAL. 

SO 

SEQUENCE 

454 AA; 

51736 

MM; 1034006 CN; 


Initial Score = 
Residue Identity = 
Gaps = 


111 Optimized Score = 153 
54* Matches = 155 

2 Conservative Substitutions 


Significance 
Mismatches 


= 52.36 


X Id 20 30 40 50 60 

EPHWKEFRFDLTQ IPAGEAVTAAEFR I YK VPS- 1 HLLNRTLH vSMFQV VQEQSNRESDLFFL 
I I I I I I I I I I II I I I I I I I I I I I I I I II I I I II I I I I 

VERDKDFSHQRRHYKEFRFDLTQIPHGEAvTAAEFRI YKDRSNNRFENET I K I S I YQ 1 1 KEYTNRDADLFLL 
170 X 180 190 200 210 220 230 


70 80 90 100 110 120 130 

DLQTLRAGDEGWLVLDVTAASDCWLLKRHKDLGLRLYVETEDGHSVDPGLAGLLGQRAPRSQQPFVVTFFRA 
I I I I I I I I I I I I I I I I I I I I I I I I I I Ml I II I 

DTRKAQALDVGWLVFDITVTSNHWVINPQNNLGLQLCAETGDGRSINVKSAGLVGRDGPQSKQPFMVAFFKA 
240 250 260 270 280 290 300 

140 150 160 170 180 190 200 

SPSP I RTPRA V RPLRRRQPKKSNELPQANRLPG I FDD VH6SHGRQ VCRRHELY VSFQDLGWLDWV I APQGYS 
I I M II I I I I I I I I I I I I I I I I I I I I I I I 

SEvLLRSVRA-ANKRKNQNRNKSSSHQDSSRMSSvGDYNTSEQKQACKKHELYvSFRDLGWQDWIIAPEGYA 
310 320 330 340 350 360 370 

210 £20 230 240 250 £60 £70 

AYYCEGECSFPLDSCMNATNHAILQSLVHLMKPNAVPKACCAPTKLSATSVLYYDSSNNVILRKHRNMVVKA 
I II I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I 
AFYCD6ECSFPLNAHMNATNHAIUQTLVHLMFPDHVPKPCCAPTKLNAISVLYFDDSSNVILKKYRNMVVRS 
380 390 400 410 4£0 430 440 450 


280 
CGCH 
I I I I 
CGCH 
X 
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i/5/1 (Item 1 from file: 351) 

008644667 WPI Acc No: 91-148697/20 

XRAN Acc No: C91 -064306 

Osteogenic device for implantation in a mammal - comprising a matrix of 
treated collagen particles and a protein vhich induces bone formation 

Patent Assignee: (CREA-) CREATIVE BIQMOLECULES INC; (CREA-) CREATIVE 
BIOMQLECUL 

Author (Inventor): QPPERMAN H; KUBERASAMP T; RUEGER D C; OZKAYNAK E; PANG R 
H L 

Number of Patents: 005 
Number of Countries: 017 


Patent Family: 




CC Number 

Kind 

Date 

Week 

WO 9105802 

A 

910502 

9120 

CA 2027259 

A 

910418 

9126 

Al) 9066481 

A 

910615 

9133 

EP 448704 

A 

911002 

9140 

JP 4502336 

W 

920423 

9223 


Priority Data <CC No Date): US 422699 (891017); US 483913 (900222); US 
569920 (900820) 

Applications (CC, No, Date) : JP 90515578 (901015); WO 90US5903 (901015); EP 

90916655 (901015) 
Language: English 

EP and/or WO Cited Patents: EP 128041; EP 148155; EP 169001; EP 169016; EP 
182483; EP 212474; EP 230647; EP 309241; US 4172128; US 4294753; US 
4394370; US 4434094; US 4563350; US 4563489; US 4657548; US 4703108; US 
4725671; US 4789663; US 4812120; US 4824939; US 4837285; US 4894441; WO 
8600526; WO 8800205; WO 8909605; WO 8910409; HJnl.Ref 

Designated States 
(National): AU; CA; JP 

(Regional): AT; BE; CH; DE; DK; ES; FR; GB; GR; IT; LU; NL; SE; LI 

Filing Details: JP04502336 Based on WO 9105802 

Abstract (Basic): WO 9105802 

An osteogenic device for implantation in a mammal comprises (a) a 
biocompatible, in vivo biodegradable matrix of mineral-free, 
delipidated Type I insol. bone collagen particles, depleted in 
noncollagenous protein and (b) a protein (I) produced by the expression 
of recombinant DNA in a mammalian cell, (I) comprising 2 oxidised 
subunlts, the amino acid sequence of each subunit being sufficiently 
duplicative of the amino acid sequence shown (0P1-16V) such that the 
dimeric species comprising the subunits has a conformation that is 
capable of inducing endochondral bone formation in a mammal vhen 
disposed vithin the matrix and implanted in the mammal. 

An osteogenic protein expressed from recombinant DNA in a 
mammalian host cell and capable of inducing endochondral bone formation 
in a mammal vhen disposed vithin a matrix implanted in the mammal, 
comprises (I). Also claimed are a biocompatible, in vivo biodegradable 
matrix for implantation in a mammal, an osteogenic protein expressed 

from recombinant DNA in a mammalian host cell, end a biocompatible, in 
vivo biodegradable matrix for implantation. 

USE/ADVANTAGE - The collagen particles are treated to increase 
the intraparticle porosity and the surface area of the particles. The 
matrix obtd. may be combined with osteogenic protein to induce 
endochondral bone formation reliably and reproducibly in a mammalian 

hody. The osteogenic deviGe can induce at the locus of the implant the 
full developmental cascade of endochondral bone formation including 
vascularisation, mineralisation and bone marrov differentiation. (I) 


dependent. @<105pp Dug. No. 0/13)9 
' File Segment: CPI 

?nT*Pat Ellis: A?Ift-81?^02; A61K-037/12; C07K-015/06; C07K-017/02; 

C09H-001/02} C12N-015/00 
Chemical Fragment Codes <H1): 

•01* H423 M424 M710 H740 H903 N135 P421 P714 Q233 V752 
Chemical Fragment Codes ( MG ) : 

•02* M903 P421 P714 Q233 R220 

? 


1. US-07-800-364A-7 (1-199) 
Q15£4£ BMP-8 peptide. 


ID Q15£4£ standard; DNA; 199 BP. 

AC Q15£4£; 

DT 09-MAR-199E (first entry) 

DE BMP-8 peptide. 

KW Cartilage; wound healing; tissue repair; BMP; ss. 

OS Bos taurus. 

FH Key Location/Qualifiers 

FT CDS 30. . 176 

FT /*tag= a 

FT /label= BMP-8 

PN WO9118098-A. 

PD 28-N0V-1991. 

PF 15-MAY-1991; U0338B. 

PR 16-MAY-1990; US-525357. 

PR 15-JAN-1991; US-641E04. 

PA (GENE-) GENETICS INST INC. 

PI Hewick RM, Wang JH; 

DR WPI; 91-369£5£/50. 

DR P-PSDB; R155££. 

PT New BMP-8 protein - useful in inducing cartilage and/or bone 

PT foraiation to treat wounds and repair fractures and tissues, e.g. 

PT burns, incisions and ulcers 

PS Disclo sure; Page 27$ 50pp; English. 

CC Nucleotide 95 is a "T" whereas in the amplified DNft fragment the 

CC corresp. nucleotide is "C M . 

CC This sequence encodes amino acids 64-112 (C-terminal) of the BMP-8 

CC sequence given in R15522- BMP-8 peptide fragments were purified from 

CC ground bovine bone powder. Oligonucleotide probes were designed based 

CC on these peptides and used to screen bovine genomic libraries to 

CC isolate DNfi sequences encoding BMP-8 proteins. 

CC Pharmaceutical compsns. contg. BMP-8 can be used to aid bone and/or 

CC cartilage formation or wound healing and tissue repair. The 

CC proteins are not very species specific so can be used in domestic 

CC and farm animals as well as humans. 

CC See also Q15240-48, R15517 and R15522. 

SQ Sequence 199 BP; 34 A; 87 C; 52 G; 26 T; 


Initial Score = 199 Optimized Score = 199 Significance = 14.46 
Residue Identity = im% Matches = 199 Mismatches = ® 

Gaps = © Conservative Substitutions = 0 

X 10 20 30 40 50 60 70 

TGCCCGCTGCCCCCTCCCGCCCCCGCCftGGTGCftCCTGCTGflAGCCGCftCGCGGTCCCCftflGGCGTGCTGCG 
I I I I I I I I I 11 I I I ! 1 1 I 1 I I I M I I I I I I M I I M I I I I I i I I ! ! I I I I I I 11 I I I I I I I I 1! I I M I M I 
TGCCCGCTGCCCCCTCCCGCCCCCGCCftGGTGCACCTGCTGfiflGCCGCACGCGGTCCCCflflGGCGTGCTGCG 
X 10 20 30 40 50 60 70 


-1» 


80 90 100 110 120 130 140 

CGCCCflCCfiftGCTGftGCGCCPCTTCCGTGCTCTflCTACGftCflGCflGCftfiCfiflCGTCftTCCTGCGCftfiGCftCC 
1 I I I I i I I I II I ! M I I ! M 1 I I i I I I i I I I I I I M I I I I I i I II I II I I II I I ! I I i I I I i M I I I I I I M 
CGCCCACCfiftGCTGAGCGCCftC^ 

80 90 100 110 120 130 140 


150 160 170 180 190 X 

GCAftCftTGGTGGTCCGCGCCTGCGGCTGCCACTGAGGCCCCftftCTCCACCGGCftG 
I I II I I I I I II I I I II II M I I I I I I I M M II I I I M I I M M II I I I i I i II I 
GCflftCATGGTGGTCCGCGCCTGCGGCTGCCACTGfiGGCCCCftftCTCCftCCGGCftG 

150 160 170 180 190 X 


. U5-07-800-364A-7 (1-199) 

DRO60APRO D. raelanogaster 60A protein (60A) mRNA, complete cd 

LOCUS DRO60APRO 1632 bp ss-mRNA INV il-SEP-1992 

DEFINITION D. raelanogaster 60A protein (60A) mRNA, complete cds. 
ACCESSION M84795 

KEYWORDS 60A protein; bone roorphogenet ic protein; 

transforming growth factor. 
SOURCE Drosophila melanogaster (library: Kopczynski-Muskavitch) embryo 

cDNA to ibRNA. 
ORGANISM Drosophila melanogaster 

Eukaryota; Animal ia; Metazoa; Arthropoda; Uniramia; Insecta; 

Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 

Cyclorrhapha; Schizophora; Drosophi loidea; Drosophi 1 idae, 
REFERENCE 1 (bases 1 to 1632) 

AUTHORS Doctor, J. S. , Jackson, P. D. , Rashka, K. E. , Visalli,M. and 

Hoffmann, F. M. 

TITLE Sequence, biocheaiical characterization and developmental 
expression 

of a new member of the TGF-beta superfamily in Drosophila 
melanogaster 
JOURNAL Dev. Biol. 151, 491-505 (1992) 
STANDARD full automatic 
FEATURES Locat ion/Qual if iers 

CDS 142. . 1509 

/gene="60A" 

/product= n 60A protein 1 ' 
/codon start=l 


/ 1 ran s 1 at i on= M MSGLRNTSEAM A VLASLGLGM VLLMFM ATTPPAVEATGSG I Y I D 
N6KDQT I MHR VLSEDDKLDMSYE I LEFLGI AERPTHLSSHQLSLRKSAPKFLLDV YHR 
I TAEEGLSDQDEDDDYERGHRSRRSADLEEDEGEQQKNF I TDLDKRA I DESD 1 1 MTFL 


NKRHHNMDELRHEHGRRLWFDVSNVPNDNYLVMAELR1YGNANEGKWLTANREFTITV 


YfllGTGTLGQHTMEPLSSVNTTGDYVGWLELNVTEGLHEWLVKSKDNHGIYIGflHOVN 


RPDRE VKLDD I 6L I HRK VDDEFQPFM I GFF RGPEL I K AT AHSSHH RSK RS ASHPRK RK 


KS V SPNN V PLLEPMEST RSCQMQTLY I DFKDLGWHDW 1 1 APEGYGAFYCSGECNFPLN 


RHMNfiTNHftl VQTLVHLLEPKKVPKPCCftPTRLGflLPVLYHLNDENVNLKKYRNMI VK 

SCGCH" 

BASE COUNT 414 a 479 c 453 g £86 t 

ORIGIN chromosome 2R — 60A 


Initial Score = 104 Optimized Score = 123 Significance = 6.66 
Residue Identity = 63% Matches = 134 Mismatches = &£ 

Gaps = 16 Conservative Substitutions = 8 

GACTTCAA6GATCTGGGCTGGCATGACTGGATCATCGCACCA6AGGGCTATGGCGCCTTCTACTGCAGCGGC 
1230 1240 1250 1260 1270 1280 1290 

X 10 20 30 

TGCCC6C — T6CCCCCTCC CG-CCCCCGCCAGGTGCACCTG 

II III I I I I I I I I I I I I I I I I I I I I 
GAGTGCAATTTCCCGCTCAATGC6CACAT6AACGCCACGAACCAT6C6ATCGTCCA6ACCCT6GTCCACCTG 
1300 1310 1320 X 1330 1340 1350 1360 


40 50 60 70 80 90 100 

CTGAAGCCGCACGCGGTCCCCAAGGCGTGCTGCGCGCCCACCAAGCTGAGCGCCACT-TCCGTGCTCTACTA 
III I I I I I I III I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II III I 
CTGGAGCCGAAGAAGGTGCCCAAGCCCTGCTGCGCTCCGACCAGGCT6GGAG-CACTACCCGTTCTGTACCA 
1370 1380 1390 1400 1410 1420 1430 


110 120 130 140 150 160 170 180 

CGACAGCAGCAACAACGTCATCCTGC6CAAGCACCGCAACATGGTGGTCCGCGCCTGCGGCTGCCACTGA6G 
I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CCTGAACGACGAGAATGT6AACCTGAAAAAGTATAGAAACATGATT6TGAAATCCTGCGGGTGCCATTGA— 
1440 1450 1460 1470 1480 1490 1500 


190 X 
CCCCAACT — CCflC — CGGC — AG 
I I I I I I I I III II 
ATTCAATTTACCACTTAGGCTAAGCACTATAGCACTATATATAAAGCATTTCTGT6TAGATGTCCATCTCT6 
1510 1520 1530 X 1540 1550 1560 1570 1580 


TACTAATCTCGAATATTTTTATACAAAAATATACAGCGCTGTACCTAACTG 
1590 1600 1610 1620 1630 


1. US-07-800-364A-7 < 1-199) 

QQBE8 BPLF1 protein - Human herpesvirus 4 (strain B95-8) 


ENTRY 
TITLE 
DATE 

PLACEMENT 
SOURCE 
ACCESSION 
REFERENCE 
#Authors 


QQBE8 #Type Protein 

BPLF1 protein - Human herpesvirus 4 (strain B95-8) 
£5-Feb-1985 #Sequence £5-Feb-1985 #Text 31-Mar-1992 
1934.0 1.0 1.0 1.0 1.0 
human herpesvirus 4, Epstein-Barr virus 
A03747 

(Protein coding region) 

Baer R. , Bankier A. T. , Biggin M. D. , Deininger P. L. , 
Farrell P.J., Gibson T. J. , Hatfull G. , Hudson 
G. S. , Satchwell S. C. , Seguin C. , Tuffnell P. S. , 
Barrell B.G. 
#Journal Nature (1984) 310:£07-£11 
#Title DNA sequence and expression of the B95-8 

Epstein-Barr virus genome. 
#Ref erence-number A03794 
COMMENT The sequence was translated from the DNA sequence, 

which was obtained from GenBank, release £6.0. 
SUMMARY #Molecular-weight 337955 #Length 3149 #Checksum 7480 
SEQUENCE 


Initial Score = 
Residue Identity = 
Gaps = 
Translation Frame= 


£0 Optimized Score = £4 

37* Matches = £9 

14 Conservative Substitutions 
1 


Significance = 5.31 
Mismatches = 35 


AHVRVSTYAHDILQYVGAPGAQYTCVHLYFLPEAFETEDPRIFMLEHYGVYDFYEANGSGFDLVGPELVSSD 
£10 ££0 £30 £40 £5© £60 £70 

X 10 £0 30 

CPLP PPAPAR CTCXSRTRSPRRAARPPSXAPLP 

III Mil I I I I I I I I I 

GEAAGTPGADSSPPVMLPFERRIIPYNLRPLPSRSFTSDSFPA-ARYSPAKTNSPPSSPASAA-PASAAPAS 
£80 290 300 X 310 3£0 330 340 

40 50 60 X 

CSTTTAATTS-SCASTATWWSAPAAATEAPTPPA 

II I III I I I I I II 
AAPASAAPASAAPASAAPASAAPASAAPASSPPLF I P I PGLGHTPG VPAPSTPPRASSGAAPQTPKRKKGLG 
350 360 370 380 390 400 410 


KDSPHKKPTSGRRLPLSSTTDTEDDQLPRTHVPPHRPPSAARLPPPVIPIPHQSPPASPTPH 
4£0 430 440 450 460 470 480 


1. US-07-800-364A-7 (1-199) 

TEGU EBV LARGE TEGUMENT PROTEIN. 


-4- 


ID TEGU_EBV STANDARD; PRT; 3149 AA. 

AC P03186; 

DT El-JUL-1986 (REL. 01, CREATED) 

DT £1 -JUL- 1986 (REL. 01, LOST SEQUENCE UPDATE) 

DT 01 -MAY- 1992 (REL. 22, LAST ANNOTATION UPDATE) 

DE LARGE TEGUMENT PROTEIN. 

GN BPLF1. 

OS EPSTEIN-BARR VIRUS (STRAIN B95-8). 

OC VIRIDAE; DS-DNA ENVELOPED VIRUSES; HERPESVIRIDAE. 

RN m 

RP SEQUENCE FROM N. A. 

RM 84270667 

RA BAER R. , BANKIER A.T., BIGGIN M. D. , DEININGER P. L. , FARRELL P. J., 

RA GIBSON T.J. , HATFULL G. , HUDSON G.S. , SATCHWELL S.C., SEGUIN C. , 

RA TUFFNELL P. S. , BARRELL B. G. ; 

RL NATURE 310:207-211(1984). 

CC -!- FUNCTION: TEGUMENT PROTEIN. 

CC -!- SIMILARITY: BELONGS TO FAMILY THAT GROUPS TOGETHER HSV-1 UL36, 
CC EBV BPLF1, AND VZV £2. 

DR EMBL; V01555; EBV. 

DR PIR; A03747; QQBE8. 

SQ SEQUENCE 3149 AA; 337954 MW; 1.65O784E+07 CN; 

Initial Score = £0 Optimized Score = 24 Significance = 5.70 

Residue Identity = 375C Matches = 29 Mismatches = 35 

Gaps = 14 Conservative Substitutions = 0 

Translation Frarae= 1 

AHVRVSTYAHDILQYVGAP6AQYTCVHLYFLPEAFETEDPRIFMLEHYGVYDFYEANGSGFDLVGPELVSSD 
210 2£0 £30 £40 £50 £60 270 

X 10 20 30 

CPLP PPAPAR CTCXSRTRSPRRAARPPSXAPLP 

III I I I I I I I I I I I I I 

GEAAGTPGADSSPP VMLPFERR 1 1 PYNL RPLPSRSFTSDSFPA-ARYSPAKTNSPPSSPASAA-PASAAPAS 
£80 290 300 X 310 320 330 340 

40 50 60 X 

CSTTTAATTS-SCASTATWWSAPAAATEAPTPPA 

II I III I I I I I II 
AAPASAAPASAAPASAAPASAAPASAAPASSPPLF I P I PGLGHTPGVPAPSTPPRASSGAAPQTPKRKK6LG 
350 360 370 380 390 400 410 

KDSPHKKPTSGRRLPLSSTTDTEDDQLPRTHVPPHRPPSAARLPPP V I PI PHQSPPASPTPH 
4£0 430 440 450 460 470 480 


£. US-07-8O0-364A-7 (1-199) 

Y115 ADE0£ HYPOTHETICAL PROTEIN E-115. 


ID 
AC 
DT 
DT 
DT 
DE 
OS 
OC 
RN 
RP 
RM 
RP 
RA 
RL 
DR 
DR 
KW 
SQ 


Yll 


ADE02 


STANDARD ; 


PRT; 


115 AA. 


P03290; 


21-JUL-1986 
21-JUL-1986 
01-APR-1990 


(REL. 01, CREATED) 
(REL. 01, LOST SEQUENCE UPDATE) 
(REL. 14, LAST ANNOTATION UPDATE) 
HYPOTHETICAL PROTEIN E-115. 
HUMAN ADENOVIRUS TYPE 2. 

VIRIDAE; DS-DNA NONENVELOPED VIRUSES; ADENOVIRIDAE 
CI] 

SEQUENCE FROM N. A. 
83056843 

6INGERAS T. R. , SCIAKY D. , GELINAS R. E. , 
KELLY M.M. , BULLOCK P. A., PARSONS B. L. , 
J. BIOL. CHEM. 257:13475-13491(1982). 
EMBL; J01917; AD2. 
PIR; A03862; A03862. 
HYPOTHETICAL PROTEIN. 

SEQUENCE 115 AA? 12236 MW; 68893 CN; 


BING-D0N6 J., YEN C. E. , 
O'NEILL K.E. , ROBERTS R. J. ; 


Initial Score = 
Residue Identity = 
Gaps = 
Translation Frame= 


12 Optimized Score = 23 

295C Matches = 26 

21 Conservative Substitutions 
1 


Significance = 
Mismatches = 


5. 18 

40 

0 


X 10 £0 30 40 

CPLPPPAPARCTCXSRTRS PRRAARPPSXAPLPCSTTTAAT 

III III I I I I I II 

MGETWFLTPNGQSSPGSWNARPSAGPAARMPTPRNRYFSRPSSTPLKVCTAPRAAPPPRASCAPRATPRRGW 
10 20 30 40 50 60 70 


50 60 X 

TSSCASTATW WSAPA-AATEAPTPPA 

I I I I I I I I I I I I I 

TMTPWSNATWPTRRAKTGSASAPAGPASSAPWPARSPERTPSP 


1. US-07-800-364A-9 (1-172) 

A33532 *Mucin - Human (fragment) 


ENTRY A33532 #Type Protein (fragment) 

TITLE *Mucin - Human (fragment) 

DATE 04-Sep-1992 #Sequence 04-Sep-1992 #Text 04-Sep-199£ 

PLACEMENT 0.0 0.0 0.0 0.0 0.0 

COMMENT *This entry is not verified. 

SOURCE Homo sapiens ^Common-name man 

REFERENCE 

#Authors Gum J. R. , Byrd J. C. , Hicks J. W. , Toribara N. W. , 

Lamport D. T. A. , Kim Y.S. 

#Journal J. Biol. Chem. (1989) 264:6480-6487 


-6- 


#Title Molecular cloning of human intestinal mucin cDNAs. 

Sequence analysis and evidence for genetic 

polymorphism. 
#Ref erence-number A33532 
Accession A33532 

#Cross-reference GB:M22404; GB:M22405; GB:M22406 
SUMMARY #Length 519 #Checksum 4853 

SEQUENCE 


Initial Score = 
Residue Identity = 
Gaps = 
Translation Frame= 


11 Optimized Score = 23 Significance = 7-36 

33% Matches = 26 Mismatches = 31 

20 Conservative Substitutions = 0 
1 


1. US-07-800-364A-11 (1-119) 

S16244 Bone morphogenet ic protein 2 precursor - African 

ENTRY S16244 #Type Protein 

TITLE Bone morphogenet ic protein 2 precursor - African 

clawed frog 

DATE 30-Jun-1992 #Sequence 30-Jun-1992 #Text 31-Sep-1992 

PLACEMENT 0.0 0.0 0.0 0.0 0.0 

SOURCE Xenopus laevis #Common-name African clawed frog 

ACCESSION S16244 

REFERENCE 

#Authors Plessow S. , Koester M. , Knoechel W. 

#Journal Biochim. Biophys. Acta (1991) 1089:280-282 

#Title cDNA sequence of Xenopus laevis bone morphogenet ic 

protein 2 (BMP-2). 

#Ref erence-number S16244 

#Accession S16244 

#Molecule-type roRNA 

#Residues 1-398 <PLE> 

#Cross-reference EMBL:X5503i 
KEYWORDS dimer\ glycoprotein 

FEATURE 

285-398 #Protein bone morphogenet ic protein 2 

(predicted) (MAT) \ 
137,202,340 #Binding-site carbohydrate (Asn) 

(co va lent ) (predicted) 
SUMMARY #Molecular-weight 45575 #Length 398 #Checksum 9714 

SEQUENCE 

Initial Score = 11 Optimized Score = 15 Significance = 6.02 

Residue Identity = 32% Matches = 18 Mismatches = 21 

Gaps = 16 Conservative Substitutions = 0 

Translation Frame= 1 


-7- 


SDSSKLHR I N I YD I VKPAAAASRGP V V RLLDTRL VHHNESKWESFD VTPA IfiRWI AHKQPNHGF VVEVNHLD 
170 180 190 200 210 220 230 

X 10 20 

PLRVPADD— VHGSHGRQ VCRRHELYVSFQ 

I I I I I I I I I I I I I I 

NDKNVPKKHVRISRSLTPDKDNWPQIRPLLVTFSHDGKGHflLHKRQKRQARHKQRKRLKSSCRRHPLYVDFS 
240 250 260 X 270 280 290 300 

30 X 
DL6WLVSSDSP 
III I 

DV6WNDWI VfiPPGYHflFYCHGECPFPLflDHLNSTNHftI VQTLVNSVNTNIPKflCCVPTELSfllSMLYLDENE 
310 320 330 340 35© 360 370 380 

K V VLKNYQDMV VEGCGCR 
390 


US-07-800-364fl-ll < 1-1 19) 

B39263 *Bone raorphogenetic protein 6 - Human 


ENTRY 
TITLE 
DATE 

PLACEMENT 
COMMENT 
SOURCE 
REFERENCE 
ttAuthors 


#Journal 
#Title 


B39263 #Type Protein 

*Bone raorphogenetic protein 6 - Hunan 

04-Oct-1991 ^Sequence 04-Oct-1991 #Text 04-Oct-1991 

0.0 0.0 0.0 0.0 0.0 

*This entry is not verified. 
Homo sapiens ttCommon-name nan 


Celeste 
R.M. , 


A.J. 


Rosen 


Iannazzi 
V., Wang 


A. 


, A. 


Taylor R. C. 
Wozney J.M. 


Hewick 


J. 
E. 

Proc. Natl. Acad. Sci. U.S.A. (1990) 87:9843-9847 
Identification of transforming growth factor beta 
family members present in bone-inductive protein 
purified from bovine bone. 
#Ref erence-nunber A39263 
#Accession B39263 
ttCross-reference GB:M38694 
SUMMARY #Molecular-weight 57225 #Length 513 ^Checksum 183 

SEQUENCE 


Initial Score = 
Residue Identity = 
Gaps = 
Translation Frame= 


14 Optimized Score = 19 Significance = 10.04 

50% Matches = 20 Mismatches = 19 

1 Conservative Substitutions = 0 
1 


DTRVVWASEEGWLEFDITATSNLWVvTPDHNMGLQLSVVTRDGVHVHPRAAGLVGRDGPYDKQPFMVAFFKV 
300 310 320 330 340 350 360 


10 


?0 


30 


-8- 


PLRV-PADDVHGSHGRQVCRRHELYVSFGDLGWLVSSDSP 
It I I I I I t 11 I II I I i i f I 1 

SEyHVRTTRSftSSRRRQQSRNRSTQSQDVflRVSSftSDYNSSELKTftCRKHELYVSFQDLGWQDWI IftPKGYft 
370 380 390 X 400 410 420 430 X 

ANYCDGECSFPLNflHMNATNHftlVQ 

440 450 460 470 480 490 500 

CGCH 
510 


3, US-07-800-364A-11 (1-1 19) 

JQ1184 Osteogenic protein 1 precursor - Mouse 

ENTRY JQ1184 #Type Protein 

TITLE Osteogenic protein 1 precursor - Mouse 

DATE 31-Dec-1991 #Sequence 31-Dec-199i 4TText 31-Sep-1992 

PLACEMENT 0.0 0-0 0,0 0.0 0.0 

SOURCE Mus musculus ttCommon-name house mouse 

ACCESSION JQ1184N PQ0224 

REFERENCE 

#Authors Oezkaynak E. , Schnegelsberg P. N. J. , Gppermann H. 

#Journal Biochem. Biophys. Res. Commun. (1991) 179:116-123 

#Title Murine osteogenic protein (GP-1): high levels of 

mRNA in kidney. 

#Ref erence-number JQ1184 

#Accession JQ1184 

#Molecule-type DNA 

#Residues 1-120 <OEZ> 

#Cross-ref erence EMBL:X56906 

# Accession PQ0224 

#Molecule-type mRNA 

#Residues 54-430 <OE2> 
COMMENT This protein induces bone format ion. 

FEATURE 

1-29 #Domain signal sequence <SIG>\ 

30-290 #Peptide propeptide (predicted) <PRO>\ 

291-430 tfProtein osteogenic protein-1 

(experimental ) (MAT) \ 
186,301,320,371 #Binding-site carbohydrate (Asn) 

(co va lent ) (predicted) 
SUMMARY #Molecular-weight 49283 #Length 43© ^Checksum 9190 

SEQUENCE 

Initial Score = 14 Optimized Score = 17 Significance = 8.03 
Residue Identity = 45* Matches = 18 Mismatches = 21 

Gaps = 1 Conservative Substitutions = 0 

Translation Frame= 1 


-9- 


DSRT I WASEEGWL VFDI TATSNHWV VNPRHNLGLQLSVETLDGQSI NPKLAGL I GRHGPQNKQPFMVAFFKA 
££0 230 £40 £50 £60 £70 £80 


X 10 £0 30 X 

PLR-VPADDVHGSHGRQVCRRHELYVSFQDLGWLVSSDSP 
II I III I I I I I I I I I I I I 

TEWLRSIRSTGGKQRSQNRSKTPKNQEALRMASVAENSSSDQRQACKKHELYVSFRDLGWQDWIIAPEGYA 
£90 300 310 3£0 330 340 350 

AYYCEGECAFPLNSYMNATNHA IVQTLVHFI NPDT VPKPCCAPTGLNA I S VL YFDDSSN V I LKKYRNMV V RA 
3&0 370 380 390 400 410 4£0 

CGCH 
430 


4. US-07-800-3&4A-11 (1-119) 

C39263 *Bone morphogenet ic protein 7 - Human 


ENTRY 
TITLE 
DATE 

PLACEMENT 
COMMENT 
SOURCE 
REFERENCE 
ttAuthors 

# Journal 
#Title 


C392&3 #Type Protein 

♦Bone niorphogenetic protein 7 - Human 

04-Oct-1991 #Sequence 04-Oct-1991 ttText 04-Oct-1991 

0.0 0.0 0.0 0.0 0.0 
*This entry is not verified. 
Homo sapiens ttCommon-name man 

Celeste A. J. , Iannaz2i J. A. , Taylor R. C. , Hewick 


R.M. 


Rosen V. , Wang E. A. , Wozney J. M. 


ttReference 
^Accession 
#Cross-ref 

SUMMARY 

SEQUENCE 


Proc. Natl. Acad. Sci. U.S.A. (1990) 87:9843-9847 
Identification of transforming growth factor beta 
family members present in bone-inductive protein 
purified from bovine bone, 
-number A392&3 

C39263 
erence GB:M38695 
#Molecular-weight 49313 #Length 431 #Checksum 128 


Initial Score = 14 Optimized Score = 17 Significance = 8.03 

Residue Identity = 45% Matches = 18 Mismatches = 21 

Gaps = 1 Conservative Substitutions = 0 

Translation Frame= 1 

DSRTLWASEEGWLVFDITATSNHWVVNPRHNL6LQLSVETLDGQSINPKLAGL I GRHGPQNKQPFMVAFFKA 
££0 £30 £40 £50 £&0 £70 £80 


X 10 £0 30 X 

PLR-VPADDVHGSHGRQVCRRHELYVSFQDLGWLVSSDSP 
II I III I I I I I I I I I I I I 

TEVHFRSIRSTGSKQRSQNRSKTPKNQEALRMANVAENSSSDQRQACKKHELYVSFRDLGWQDWIIAPEGYA 


-10- 


290 308 310 X 320 330 340 350 

ftYYCEGECflFPLNSYMNftTNHfllVQTLVHFINPETVPKPCCftPTQLNAISyLYFDDSSNy ILKKYRNMyVRfi 
360 370 380 390 400 410 420 

CGCH 
430 


5- US-07-800-364A-11 < 1-1 19) 

S10529 ^Osteogenic protein precursor - Human 

ENTRY S10529 #Type Protein 

TITLE ^Osteogenic protein precursor - Human 

DATE 15-Jun-1992 #Sequence 15-Jun-1992 #Text 15-Jun-1992 

PLACEMENT 0.0 0.0 0.0 0.0 0.0 

COMMENT *This entry is not verified. 

SOURCE Homo sapiens #Comnion-naine man 

REFERENCE 

#Authors Oezkaynak E. , Rueger D. C. , Drier E. ft. , Corbett C. , 

Ridge R.J. , Sampath T.K. , Oppermann H. 
#Journal EMBO J. (1990) 9:2085-2093 

#Title OP-1 cDNft encodes an osteogenic protein in the 

TGF-beta family. 

#Reference-nuraber S10529 

#ftccession SI 0529 

#Cross-reference EMBL:X51801 
SUMMARY #Molecular-weight 49313 #Length 431 #Checksum 128 

SEQUENCE 

Initial Score = 14 Optimized Score = 17 Significance = 8.03 
Residue Identity = 45'/ Matches = 18 Mismatches = 21 

Gaps = 1 Conservative Substitutions = 0 

Translation Frame= 1 

DSRTLWflSEEBWLVFDITflTSNHWVVNPRHNLGLQLSWETLDGDSINPKLflGLIGRHBPQNKQPFMVRFFKfl 

220 230 240 250 260 270 280 

X 10 20 30 X 

PLR-VPADDVHGSHGRQVCRRHELYVSFQDLGWLVSSDSP 
11 I 111 I t I I I I I It I I I 

TEVHFRSIRSTGSKQRSQNRSKTPKNQEALRMANMAENSSSDQRQACKKHELYVSFRDLGWQDWI IAPE6YA 
290 300 310 X 320 330 340 350 

AYYCEGECAFPLNSYMNATNHAIVQTLMHFINPETVPKPCCAPTQLNAISMLYFDDSSNVILKKYRNMVVRA 
360 370 380 390 400 410 420 

CGCH 
430 


-11- 


6. US-07-800-36 40-11 (1-119) 

A339£5 *Vg-l -related protein precursor - Mouse 


ENTRY 
TITLE 
DATE 

PLACEMENT 
COMMENT 
SOURCE 
REFERENCE 
^Authors 

ttJournal 
#Title 


#Ref erence 
Accession 
ttCross-ref 

SUMMARY 

SEQUENCE 


#Type Protein 
*Vg-l-related protein precursor - Mouse 
19-Sep-199£ #Sequence 19-Sep-1992 #Text 19-Sep-199£ 

0.0 0.0 0.0 0.0 0.0 

*This entry is not verified. 

Mus musculus #Comroon-name house mouse 

Lyons K. , Graycar J. L. , Lee A., Hashmi S. , Lindquist 

P.B. , Chen E.Y. , Hogan B.L. M. , Derynck R. 
Proc. Natl. Acad. Sci. U.S.A. <1989) 86:4554-4558 
Vg» — 1, a mammalian gene related to Xenopus Vg-l,'is 
a member of the transforming growth factor 
beta-gene superf ami ly. 
number A33925 
A339S5 
erence GB:J04566 
#Molecular-weight 48788 #Length 438 #Checksum 9769 


Initial Score = 15 Optimized Score = 17 Significance = 8.03 

Residue Identity = 43% Matches = 17 Mismatches = ££ 

Gaps = 0 Conservative Substitutions = 0 

Translation Frame= 1 

TRVVWASEEGWLEFDITATSNLWVVTPQHNMGLQLSy VTRDGLHVNPRAAGLVGRDGPYDKQPFMyAFFKVS 
£20 £30 £40 £50 £60 £70 £80 £90 

X 10 £0 30 X 

PLRVPADDVHGSHGRQVCRRHELYVSFQDLGWLVSSDSP 
I II I I I I I I I I I I I I I I 

EVHVRTTRSASSRRRQQSRNRSTQSQDVSRGSGSSDYNGSELKTACKKHELYVSFQDLGWQDWI IAPKGYAA 
300 310 3£0 330 340 350 360 

NYCDGECSFPLNAHMNATNHA I VQTL VHLMNPEYVPKPCCAPTKLNA I SVLYFDDNSNV I LKKYRNM V V RAC 
370 380 390 400 410 4£0 430 


GCH 


7. US-07-800-364A-11 < 1-1 19) 

A39£63 *Bone morphogenetic protein 5 - Human 

ENTRY A39£63 #Type Protein 

TITLE *Bone morphogenetic protein 5 - Human 

DATE 04-Oct-1991 #Sequence 04-Oct-1991 #Text 04-Oct-1991 

PLACEMENT 0.0 0.0 0.0 0.0 0.0 


COMMENT *This entry is not verified. 

SOURCE Homo sapiens #Common-name man 

REFERENCE 

tfftuthors Celeste ft. J. , Iannazzi J. ft. , Taylor R.C. , Hewick 

R.M. 9 Rosen V., Wang E.ft. , Wozney J.M. 
#Journal Proc. Natl, ftcad. Sci. U. S. ft. (199®) 87:9843-9847 
#Title Identification of transforming growth factor beta 

family members present in bone-inductive protein 

purified from bovine bone. 
#Ref erence-number A39£63 
ttftccession ft39£63 
#Cross-reference GB:M38693 
SUMMftRY #Molecular-weight 51736 #Length 454 #Checksura 4416 

SEQUENCE 

Initial Score = 11 Optimized Score = 16 Significance = 7. 83 
Residue Identity = 41% Matches = 16 Mismatches = £3 

Gaps = ® Conservative Substitutions = 0 

Translation Frame= 1 

DTRKftQftLDVGWLyFDITVTSNHWVINPQNNLGLQLCftETGDGRSINVKSftSLVGRQGPQSKQPFMVftFFKft 
£4© £50 £60 £70 £80 £90 300 

X 10 £0 30 X 

PLRVPftDDVHGSHGRGWCRRHELYVSFQDLGWLVSSDSP 
t I I I t I I I I I I I I I I I 
SEVLLRSVRftftNKRKNQNRNKSSSHQDSSRMSSVGDYNTSEQKQftCKKHELYVSFRDLGWQDWI IftPEGYflfi 
310 3£0 330 X 340 350 360 370 X 

FYCDGECSFPLNftHMNflTNHft I VGTL VHLMFPDH VPKPCCftPTKLNft I S VL YFDDSSN V I LKK YRNM V M RSC 
380 390 400 410 4£0 430 440 450 

GCH 


8. US-07-800-364ft-ll (1-119) 

C39364 *GDF-1 embryonic growth factor - Human 


ENTRY 
TITLE 
DATE 

PLACEMENT 

COMMENT 

SOURCE 

REFERENCE 
ttftuthors 
^Journal 
#Title 


C39364 #Type Protein 

*GDF-1 embryonic growth factor - Human 

03-Mar-199£ #Sequence 03-Mar-199£ #Text 03-Mar-199£ 

0.0 0.0 0.0 0.0 0.0 

*This entry is not verified. 
Homo sapiens tfCommon-name man 

Lee S. J. 

Proc. Natl, ftcad. Sci. U. S. ft. (1991) 88:4£50-4£54 
Expression of growth/differentiation factor 1 in the 
nervous system: conservation of a bicistronic 


-13- 


struct ure. 
#Ref erence-number A39364 
^Accession C39364 


1. US-07-800-364ft-ll (1-119) 

BMP6 HUMAN BONE MORPHOGENETIC PROTEIN 6 PRECURSOR (BMP-6). 


ID 

BMP6JHUMAN 

STANDARD; PRT; 513 AA. 

AC 

P22004; 


DT 

01-AUG-1991 

(REL. 19, CREATED) 

DT 

01 -AUG- 1991 

<REL. 19, LAST SEQUENCE UPDATE) 

DT 

01-AUG-1992 

(REL. £3, LAST ANNOTATION UPDATE) 

DE 

BONE MORPHOGENETIC PROTEIN 6 PRECURSOR (BMP-6). 

OS 

HOMO SAPIENS (HUMAN). 

OC 

EUKARYOTA; 

METAZOA; CHORDATA; VERTEBRATA; TETRAPODA; MAMMALIA; 

□C 

EUTHERIA; PRIMATES. 

RN 

Ill 


RP 

SEQUENCE FROM N. ft. 

RC 

TISSUE=BONE 

5 

RM 

91088608 


Rfl 

CELESTE ft. J., IANNAZZI J. ft. , TAYLOR R. C. , HEWICK R. M. , ROSEN V. 

RA 

WANG E.A. , 

WOZNEY J. M. ; 

RL 

PROC. NATL. 

ACAD. SCI. U.S.A. 87:9843-9847(1998). 

CC 

-!- FUNCTION: INDUCES CARTILAGE AND BONE FORMATION. 

CC 

-!- SIMILARITY: TO OTHER GROWTH FACTORS OF THE T6F-BETA FAMILY. 

DR 

EMBL; M60315; HSTGFBC. 

DR 

PIR; B39263 

; B39263. 

DR 

PROS I TE 5 PS00250; TGFJBETA. 

KW 

SIGNAL; GROWTH FACTOR; CYTOKINE; BONE; CARTILAGE; GLYCOPROTEIN. 

FT 

SIGNAL 

1 ? POTENTIAL. 

FT 

PROPEP 

? 381 POTENTIAL. 

FT 

CHAIN 

382 513 BONE MORPHOGENETIC PROTEIN 6. 

FT 

DISULFID 

412 478 BY SIMILARITY. 

FT 

DISULFID 

441 510 BY SIMILARITY. 

FT 

DISULFID 

445 512 BY SIMILARITY. 

FT 

DISULFID 

477 477 INTERCHAIN <BY SIMILARITY). 

FT 

CARBOHYD 

241 241 POTENTIAL. 

FT 

CARBOHYD 

2&9 269 POTENTIAL. 

FT 

CARBOHYD 

386 386 POTENTIAL. 

FT 

CARBOHYD 

404 404 POTENTIAL. 

FT 

CARBOHYD 

454 454 POTENTIAL. 

SQ 

SEQUENCE ! 

513 AA; 57225 MW; 1340230 CN; 


Initial Score = 14 Optimized Score = 19 Significance = 1O.01 

Residue Identity = 58% Matches = £0 Mismatches = 19 

Gaps = 1 Conservative Substitutions = © 

Translation Frame= 1 


DTRVUWASEEGWLEFDITftTSNLWVVTPQHNMGLQLSVVTRDGVHMHPRftftGLVGRDGPYDKQPFMVflFFKV 


»14» 


3m 310 


320 


330 340 


350 


X 10 20 30 X 

PLRV-PADDVHGSH6RQVCRRHELYVSFQDL6WLVSSDSP 
II II I I I I I I I I I I I I I I I I 

SEVHVRTTRSASSRRRQQSRNRSTQSQDVARVSSASDYNS5ELKTACRKHELYVSFQDLGWQDWI IAPKGYA 
370 380 390 X 400 410 420 430 X 

ANYCDGECSFPLNAHMNATNHAI VQTLVHLMNPEYVPKPCCAPTKLNAISVLYFDDNSNVILKKYRNMVVRA 
440 450 460 470 480 490 500 

CGCH 
510 


2. US-07-800-364A-11 (1-119) 

BMP7_M0USE BONE M0RPH06ENETIC PROTEIN 7 PRECURSOR (BMP-7) <OS 

ID BMP7_M0USE STANDARD; PRT; 430 AA. 

AC P23359 ; 

DT 01-NOV-1991 <REL. £0, CREATED) 

DT 01-NOV-1991 (REL. 20, LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. £3, LAST ANNOTATION UPDATE) 

DE BONE MORPHOGENETIC PROTEIN 7 PRECURSOR (BMP-7) (OSTEOGENIC PROTEIN 1) 

DE (OP1). 

GN BMP7 OR OP1. 

OS MUS MUSCULUS (MOUSE). 

OC EUKARYOTA; METAZOA; CHORDATA; VERTEBRATA; TETRAPODA; MAMMALIA; 

OC EUTHERIA; RODENT I A. 

RN C13 

RP SEQUENCE FROM N. A. 

RM 91354237 

RA OEZKAYNAK E. , SCHNEGELSBERG P. N. J. , OPPERMANN H. ; 

RL BIOCHEM. BIOPHYS. RES. COMMUN. 179:116-123(1991). 

CC -!- FUNCTION: INDUCES CARTILAGE AND BONE FORMATION. MAY BE THE 

CC OSTEOINDUCTIVE FACTOR RESPONSIBLE FOR THE PHENOMENON OF 

CC EPITHELIAL OSTEOGENESIS. 

CC -!- SUBUNIT: HOMODIMER, D I SULF I DE-LINKED. 

CC -!- SIMILARITY: TO OTHER GROWTH FACTORS OF THE T6F-BETA FAMILY. 

DR EMBL; X56906; MMOP1G. 

DR PIR; JQ1184; JQ1184. 

DR PROSITE; PS00250; TGF_BETA. 

KW SIGNAL; GROWTH FACTOR; CYTOKINE; BONE; CARTILAGE; GLYCOPROTEIN. 


FT 

SIGNAL 

1 

29 

POTENTIAL. 

FT 

PROPEP 

30 

291 

POTENTIAL. 

FT 

CHAIN 

292 

430 

BONE MORPHOGENETIC PROTEIN 7. 

FT 

DISULFID 

3£9 

395 

BY SIMILARITY. 

FT 

DISULFID 

358 

427 

BY SIMILARITY. 

FT 

DISULFID 

362 

429 

BY SIMILARITY. 

FT 

DISULFID 

394 

394 

INTERCHAIN (BY SIMILARITY). 


FT 

CARBOHYD 

186 

186 

POTENTIAL. 

FT 

CARBOHYD 

301 

301 

POTENTIAL. 

FT 

CARBOHYD 

320 

320 

POTENTIAL. 

FT 

CflRBOHYD 

371 

371 

POTENTIAL. 

SQ 

SEQUENCE 

430 AA; 

49283 

MW; 934349 CN; 


Initial Score = 14 Optimized Score = 17 Significance = 8.01 

Residue Identity = 45% Matches = 18 Mismatches = 21 

Gaps = 1 Conservative Substitutions = 0 

Translation Frane= 1 


DSRT I WASEEGWL VFDI TATSNHWV VNPRHNLGLQLS VETLDGQS I NPKLAGL I GRHGPQNKQPFM VAFFKA 
220 230 240 250 260 270 280 


X 10 20 30 X 

PLR-vPADDVHGSHGRQVCRRHELYVSFQDLGWLVSSDSP 
II I ill I II I I I I I I I I I 

TEVHLRSIRSTGGKQRSQNRSKTPKNGEALRMASVAENSSSDQRQACKKHELYVSFRDLGWQDWIIAPEGYA 
290 300 310 320 330 340 350 

AYYCEGECAFPLNSYMNATNHA I VQTLVHF I NPDT VPKPCCAPTQLNA I SVLYFDDSSNV I LKKYRNM V V RA 
360 370 380 390 400 410 420 


CGCH 
430 


Set Items Description 
?B 357, 155, WPI, 72, 35, 5, 399 


££.jan93 15:89:16 User £08701 Session D£13. 1 
tO. 14 0. 004 Hrs Filel 
$0.14 Estimated cost Filel 
$0.04 DIALNET 

$0. 18 Estimated cost this search 

*0. IS Estimated total session cost 0.004 Hrs. 


SYSTEM: OS - DIALOG OneSearch 

File 357 : DERWEIMT BIOTECHNOLOGY ABS 193£-1992/DEC 

(Copr. 1993 Derwent Pub. ltd. ) 
File 155:MEDLINE 1966-1993/MAR (9303W3) 
File 351: DERWENT WORLD PATENTS INDEX-LATEST 

1981+;DW=9248, UA=9£35, UM=9£08 
**FILE351: Format 9 includes the expanded patent table. Pref ormatt ed 
REPORTS are available. Type 7FMT351 , ?NEWS351, 7RATES351 for more info. 
File 350:Derwent World Patents Index 

1963-1980, EQUIVALENTS THRU DW=9£43 
**FILE350: Format 9 includes the expanded patent table. Preformatted 
REPORTS are available. Type 7FMT350, 7NEWS350, 7RATES350 for more info. 
File 7£ : EMBASE (EXCERPTA MEDICA) 85-93/ ISS0£ 

(COPR. ESP BV/EM 1993) 
**FILE 7£: Truncate EMTREE Codes <e. g. DC=C1.1£0?) for complete 
**retri eval . The 9£45 update has been delayed. 
File 35:DISSERTATI0N ABSTRACTS ONLINE 1861-FEB 93 

(Copr University Microfilms) 
File 5:BI0SIS PREVIEWS 69-93/ JAN BA9503 : BARRM4403 

<C. BIOSIS 1993) 

**FILE 5: Biosystemat ic Codes (BC=) for viruses have changed for 1993. 
Type 7NEWS5 for more information and a complete list of the new codes. 
File 399 :CA SEARCH 1967-199£ UD=117£6 

(COPR. 1993 BY THE AMER. CHEM. SOC. ) 
**FILE399: Use is subject to the terms of your user customer agreement. 


Set Items Description 


?S HOPS? AND OSTEOGENIC 


£ HOPS? 
6400 OSTEOGENIC 
SI 0 HOPS? AND OSTEOGENIC 

?s Glu(£W) pro (£W)his (£W)trp(SW> lys (£W> glu(£W) phe 

Processing 


£0137 

GLU 

77393 

PRO 

133357 

HIS 

15560 

TRP 

£££16 

LYS 

£0137 

GLU 

£6919 

PHE 


S£ 0 GLU(£W)PRO<SW)HIS(£W)TRP(SW)LYS<£W)GLU<SW)PHE 

?s thr <£W) asn (£W) glu(SW) leu (SW> pro (SW) pro <£W) pro (£W) asn 


$1. 58 

*6. £6 

$2. £1 

$£. 40 

$5. 17 

*£. 16 
$£6. 4£ 
$£6. fc® 


cSvlEldvtC COSC 

$1.58 0.008 
Estimated cost 
$6. £6 0.058 
Estimated cost 
*£. £1 0. 0£3 
Estimated cost 
$£.40 0. 0£5 
Estimated cost 
$5.17 0.041 
Estimated cost 


Mi eAU± 
Hrs File350 
File350 
Hrs File7£ 
File7£ 
Hrs File35 
File35 
Hrs FileS 
FileS 

Hrs File399 
Fi le399 


OneSearch, 8 files, 0. £00 Hrs FileOS 
DIALNET 

Estimated cost this search 
Estimated total session cost 


0. £04 Hrs. 


Logoff: level £9.01.08 D 15:£0:53 


DIALNET: call cleared by request 
Enter Service: 

please see your representative 

if you are having trouble logging in?? Ix9~5<.104f 
NO CARRIER 


14167 

THR 

10800 

ASN 

£0137 

GLU 

37663 

LEU 

77393 

PRO 

77393 

PRO 

77393 

PRO 

10800 

ASN 


S3 1 THR(2W> ASN(2W)GLU(2W)LEU(2W)PR0(2W)PR0(2W)PR0(2W) ASN 

?T 3/5/1 


3/5/1 <Item 1 from file: 351) 

008865227 WPI flee No: 91-369252/50 

XRAM flee No: C9 1-159207 

New BMP-8 protein - useful in inducing cartilage and/or bone formation 
to treat wounds and repair fractures and tissues, e.g. burns, incisions 
and ulcers 

Patent Assignee: (GENE-) GENETICS INST INC 
Author (Inventor): HEWICK R M; WANG J H 
Number of Patents: 001 
Number of Countries: 014 
Patent Family: 

CC Number Kind Date Week 

WO 9118098 A 911128 9150 (Basic) 


Priority Data (CC No Date): US 525357 (900516); US 641204 (910115) 
Language : Eng 1 ish 

EP and/or WO Cited Patents: l.Jnl.Ref; WO 8910409; WO 9011366 
Designated States 
(National) : CH; JP 

(Regional): AT; BE; CH; DE ; DK; ES; FR; GB; GR; IT; LU; NL; SE 
Abstract (Basic): WO 9118098 

A purified BMP-8 protein is claimed, comprising at least one of 
the following sequences; (a) 

Arg-His-Glu-Leu-Tyi — Val-Sei — Phe-Gl u-Asp-Leu-Gl y-Trp-Leu-Asp-Trp-Val -1 1 e 
~Ala-Pro-Glu~ Gly-Tyr; (b) Leu-Ser-Ala-Thn — Set — Val-Leu 
Tyr-Tyt — Asp-Sev — Sei — Asn-Asn-Val-I 1 e-Leu-Arg ; (c) 
Ala-Cys-Cys-Ala-Pro-Thr-Lys ; (d) Thr-Asn-Gl u-Leu-Pro-Pro-Pro 
Asn-Lys-Leu-Pro-Gly-I 1 e-Phe-Asp-Asp-Val-Hi s-Gly-Sei — His-Gly-Arg ; and 
(e) the sequences homologous to the above sequences. Also claimed are a 
DNA sequence encoding a BMP-8 protein, a host cell transformed with the 
DNA and a method for producing the protein. 

USE/ ADVANTAGE - BMP-8 protein induces the formation of cartilage 
and/or bone and is used to heal wounds and repair tissue (claimed), eg 
burns, incisions and ulcers. It is also used to repair fractures and 
other defects or periodontal diseases. £(50pp Dwg. No. 0/0) £ 
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1. 5« 01 1 , 691 , Apr. 30, 1991, Osteogenic devices; Hermann Oppermann, 

et al., 424/423, 426; 514/2, 21; 530/350, 395, 840 CIMAGE AVAILABLE} 

US PAT NO: 5,011,69 1 C IMAGE AVAILABLE} LI : 1 of 1 

DATE FILED: Feb. 23, 1989 

REL-US-DATA: Continuation-in-part of Ser. No. 232,630, Aug. 15, 1988, 

which is a continuation-in-part of Ser. No. 179,406, 
Apr. 8, 1988, Pat. No. 4,968,590. 

ABSTRACT: 

Disclosed are (1) osteogenic devices comprising a matrix containing 
osteogenic protein and methods of inducing endochondral bone growth in 
mammals using the devices; <2> amino acid sequence data, amino acid 
composition, solubility properties, structural features, homologies and 
various other data charact eri z ing osteogenic proteins, (3) methods of 
producing osteogenic proteins using recombinant DNA technology, and (4) 
osteogenical ly and chondrogTERMI NAL (ENTER 1, 2, 3, 4, OR ?):3 
#i6<ilpn = s pn5=01165901 
LI 1 PN=5011691 

e=l , dbcitd, fed i , tr, ef Id, , arb 

1. 5, 011 , 691 , Apr. 30, 1991, Osteogenic devices; Hermann Oppermann, 

et al., 424/423, 426; 514/2, 21; 530/350, 395, 840 E IMAGE AVAILABLE!! 

US PAT NO: 5,011,691 [IMAGE AVAILABLE] Li: 1 of 1 

DATE FILED: Feb. 23, 1989 

REL-US-DATA: Continuation-in-part of Ser. No. 232,630, Aug. 15, 1988, 

which is a continuation-in-part of Ser. No. 179,406, 
Apr. 8, 1988, Pat. No. 4,968,590. 

ABSTRACT: 

Disclosed are (1) osteogenic devices comprising a matrix containing 
osteogenic protein and methods of inducing endochondral bone growth in 
mammals using the devices; (2) amino acid sequence data, amino acid 
composition, solubility properties, structural features, homologies and 
various other data characterizing osteogenic proteins, (3) methods of 
producing osteogenic proteins using recombinant DNA technology, and (4) 
osteogenical ly and chondrogenical 1 y active synthetic protein constructs. 

= > s Glus< 2GW1) up(r2oW()2prWo) (h2i sW < ) 2hWi ) st <r2pW < ) 2tWr ) pi (y£sW < >£lWy> sg < 12uW < 

W£gW&) up <h2e 

L2 < 2966 )GLU 

L3 ( 9856) PRO 

L4 < 107016)HIS 

L5 ( 2510) TRP 

L6 ( 2973) LYS 

L7 ( 2966 )GLU 

L8 ( 3474) PHE 

L9 0 GLU <2W) PRO (2W) HIS (2W) TRP <£W> LYS (2W) GLU (2W) PHE 

= > s alas( 2aWl)ac(y2sW) (c2y Ws ) c ( y2sW ( ) 2cWy ) sa < 12aW < ) 2aWl ) ap (r2oU) < ) 2pWr ) ot <h2rW< 

MEtfclb)rl <y2s 

L10 ( 3842) ALA 

Lll ( 1903)CYS 

L12 ( 1903) CYS 

L13 < 3842) ALA 


/ 3 x 

cgtacat gcc 

ggccgtcaag 

gacggcgagt 

tcggct tcaa 

ccttct ggt g 

801 

ggcgggttca 

tcaqccccaa 

gaggt gggcc 

gaggcgttgc 

cgctcgacgc 

851 

ct gggtcgcc 

ggggacgacg 

tcgtccccgt 

gtgcaaggcc 

at cctcgagg 

901 

egtaccggga. 
fesBGLBG Y 
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L15 ( 3£0£)THR 

L16 ( £973)LYS 

L17 O ALA(£W)CYS<£W)CYS<£W)ALA<£W)PRO<£W>THR<£W)LYS 

=> 5 thrs( £tWh)ra<s£nW<)£aWs)ng<l£uW<)£gWl)ul < e£uW < ) £lWe) up <r£oW ( > £pWr ) op <r£oW) 

WpSsWo) p <r£oW < ) £pWr ) oa<s£n 

L18 < 3£0£)THR 

L19 < £046)ASN 

L£0 < £966)GLU 

LSI < 3565) LEU 

L££ ( 985&)PR0 

L£3 < 9856) PRO 

L£4 < 9856) PRO 

L£5 < £046)ASN 

L£6 0 THR(£W)ftSN<£W)6LU<£W)LEU(£W)PR0(£W)PR0<£W)PR0(£W)ASN 

?EC6??GAsA T7TGCACAGT? 
L£7 £3 7GAATTCCG? 

eECe?L£7S ALN£D7 7AANDG C7CACCGAC? 

L£8 ( 5)?AGCCCCA? 

L£9 1 L£7 AND 7AGCCCCA? 


6IL,9BCITD, FCIDT, , RFELD, , ARB 


1. 4,997,93®, Mar. 5, 1991, Cloning of complementary DNA encoding maize 
nitrite reductase; Kristine N. Lahners, et al., 536/£7; 435/69.1, 70.1, 
17£.3, £40.4, 317.1, 3£0. 1 ; 935/18, £1, 35, 64, 78, 79 [IMAGE AVAILABLE3 

US PAT NO: 4,997,930 C IMAGE AVAILABLE3 L£9 : 1 of 1 

DATE FILED: Mar. 16, 1989 

ABSTRACT: 

Maize cDNA coding for nitrite reductase is cloned, using a spinach 
nitrite reductase cDNA as a heterologous probe, and is characterized. A 
method is provided to use the cloned maize nitrite reductase cDNA to 
determine the number of nitrite reductase genes per maize genome and to 
study nitrite reductase mRNA regulation in maize. 

=KWD>CKWIDC 
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CLAIMS: 

CLMS<£) 

£. An isolated DNA sequence according to claim 1, wherein the sequence 
comprises : 


1 qaattccqqq 

ccgcacaggg 

cgcgcccgcg 

cggccgtctc 

cgt gccgccg 

51 ccggcggggg 

agcaggtccc 

gacggagcgg 

ctggagccga 

gggtcgagga 

101 

tgtacgagca 

cccgcacatc 

aacgacctcg 
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